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ABSTRACT 

We model mock observations of collisionless TV-body disc-disc mergers with the same 
axisymmetric orbit superposition program that has been used to model elliptical galax- 
ies in Coma. The remnants sample representatively the shape distribution of disc-disc 
mergers, including the most extreme cases, like highly prolate, maximally triaxial and 
dominantly oblate objects. The aim of our study is to better understand how the 
assumption of axial symmetry affects reconstructed masses and stellar motions of sys- 
tems which are intrinsically not axisymmetric, whether the axisymmetry assumption 
then leads to a bias and how such a potential bias can be recognised in models of 
real galaxies. The mass recovery at the half-light radius depends on viewing-angle and 
intrinsic shape: edge-on views allow to reconstruct total masses with an accuracy be- 
tween 20 percent (triaxial/prolate remnants) and 3 percent (oblate remnant). Masses 
of highly flattened, face-on systems are underestimated by up to 50 percent. Deviations 
in local mass densities can be larger where remnants are strongly triaxial or prolate. 
Luminous mass-to-light ratios are sensitive to box orbits in the remnants. Box orbits 
cause the central value of the Gauss-Hermite parameter H4 to vary with viewing- 
angle. Reconstructed luminous mass-to-light ratios, as well as reconstructed central 
masses, follow this variation. Luminous mass-to-light ratios are always underestimated 
(up to a factor of 2.5). Respective dark halos in the models can be overestimated by 
about the same amount, depending again on viewing angle. Reconstructed velocity 
anisotropies (3 depend on viewing angle as well as on the orbital composition of the 
remnant and are mostly accurate to about A/3 = 0.2. Larger deviations can occur 
towards the centre or the outer regions, respectively. We construct TV-body realisa- 
tions of the Schwarzschild models to discuss chaotic orbits and the virial equilibrium 
in our models. In this study we explore the extreme limits of axisymmetric models. 
Apparently flattened, rotating ellipticals of intermediate mass are likely close to both, 
axial symmetry and edge-on orientation. Our results imply that Schwarzschild models 
allow a reconstruction of their masses and stellar anisotropies with high accuracy. 

Key words: galaxies: kinematics and dynamics - galaxies: elliptical and lenticular, 
cD - galaxies: formation 



1 INTRODUCTION 

Subject of this paper is the reconstruction of synthetic dy- 
namical systems - iV-body merger remnants - with orbit 
models. The motivation behind is to better understand mod- 
els of real dynamical systems, especially those of elliptical 
galaxies. 

Elliptical galaxies are optically smooth stellar sys- 
tems in approximate dynamical equilibrium. They can re- 

* E-mail:jthomas@mpe. mpg.de 

f E-mail: jesseit@usm.uni-muenchen.de 



suit from various kinds of merging p rocesses - for ex- 
ampl e the merging of two discs (e.g. iToomre fc Toomrel 
1972f) - or from some kind of monolithic collaps e (e.g. 
Eggen, Lvnden-Bell fc Sandagei 1 19621 : iLarsonl 1 1974 ). In a 
cosmological context an early-type can go through sev- 
eral distinct pha ses of the above prototypical forms (e.g. 
iNaab et alj|2007h . 

Apart from following the cosmic evolution of poten- 
tial progenitor systems the only way to determine ellipti- 
cal galaxy evolutionary histories is to scan their present 
structure for characteristic fingerprints of different evolu- 
tionary events. This concerns both, scaling relations of el- 



2 J. Thomas, R. Jesseit, T. Naab, R. P. Saglia, A. Burkert and R. Bender 



lipticals as a class and internal properties of individual 
systems. For example, the cold collapse of a stellar sys- 
tem results in a typical gradient from central isotropy to 
strong outer radial anisotropy in stellar orbits (|van Albadal 
1982). Galaxy mergers, on the other hand, can produce a 
variety of dynamical systems. The final structure of disc- 
disc merger re mnants depen d s, for example, on pr ogeni- 
tor properties dBarnes! Il992l; iHernquistl Il992l. Il993| ). the 
merging geometry (IWeil fc Hernauistlll996ljDubinskil 



19981) 



and on the mass ratio of the progen itors dNaab fc Burkertl 
120031 ; I Jesseit. Naab fc Burkertoosl). Ellipticals as progen- 



itors can be merged as well ( Naab. Khochfar fc Burkertl 
I2006T ). 

The difficulty with real galaxies is that their intrinsic 
properties, like the intrinsic shape, the distribution of mass 
or the geometry of stellar orbits, are not directly observ- 
able. They have to be inferred from observations through 
dynamical modeling. 

The state-of-the-art method for such model- 
ing is Schwarzsch ild's orbit superposition technique 
|Schwarzschildlll979r i. In very rough terms (1) the photom- 
etry of a galaxy is deprojected into the 3d internal light 
distribution; (2) the light distribution is multiplied with the 
stellar mass-to-light ratio and - depending on the specific 
application - a black hole or dark halo is added to obtain 
the composite mass distribution; (3) thousands of orbits 
are calculated in the resulting gravitational potential; (4) 
the orbits are added together to fit the kinematic and 
photometric observations of the galaxy. Thereby each orbit 
is weighted individually to optimise the match with the 
data. 

Schwarzschild's method can be read as a numerical im- 
plementation of Jeans' theorem, which states that station- 
ary distribution functions (the density of stars in six dimen- 
sional phase-space) of collisionless sys tems are necessarily 
funct ions of the integrals of motion fe.g. lBinnev fc Tremainel 
1987). In other words - since integrals of motion label or- 
bits (and vice versa) - the phase-space density in a station- 
ary system is constant along individual orbits. This explains 
why the fundamental building blocks of stationary dynami- 
cal systems are entire orbits and no density variation along 
individual orbits needs to be considered. In principle then, 
the only assumption underlying Schwarzschild modeling is 
that galaxies are stationary and collisionless. 

In practice, however, applications also assume a spe- 
cific internal symmetry for each object under study. This is 
to reduce the degrees of freedom in the deprojection and to 
simplify the sampling of phase-space with orbits. Axial sym- 
metry is the simplest geometry to account for intrinsic flat- 
tening, inclination effects, rotation, and the presence of disc- 
like subsystems in real galaxies. Several implementations 
of Schwarzschild's met hod for axially symmetric potentials 
have been developed dCretton et alj [l999l; iG ebhardt et alj 
200d;lHafner et alj|200d: IValluri. Merritt fc Emsellemlbool 



Thomas et al.ll2004l ; ICappellari et alj|2006l ) and have been 



used to analyse surveys of elliptical galaxy kinematics 
llGebhardt et alj|2003l ; ICappellari et al . 2006; T homas et al.l 
120071 ). 

Recovery of synthetic axisym metric test models 
from either idealised noiseless data (|Cretton et al ] 1 19991 ; 
iKrainovic" et al l 120051 ) or from realistic noisy kinematics 
1 Thomas et al.ll2005l ) has proven an accuracy level of better 



than 10 percent in these models (in cases where the recon- 
struction is reasonably well defined). Concerning applica- 
tions to real galaxies, however, the distribution of apparent 
ellipticities, isophotal twists and/or minor-axis rotation in- 
dicate that elliptica l s cann o t be exactly axisymmetric (e.g 



Bertola fc Gallettal Il979l; iFranx. Illingworth fc Heckman 
1989 1; IJedrzeiewski fc Schechterl Il989l ; iTremblav fc Merritt 
1996). Up to now, it is not clear how such intrinsic devi- 
ations from rotational symmetry in real galaxies affect the 
results of axisymmetric dynamical modeling. 

In this paper, we present first results of a project aimed 
to systematically survey the properties of axisymmetric 
Schwarzschild models that are applied to non-axisymmetric 
test objects. Specifically, we imitate realistic photometric 
and kinematical observations (realistic in terms of spatial 
coverage and resolution) of collisionless A-body merger rem- 
nants and fit them with the same Schwarzsc hild code that 
has b een used for a study of Coma ellipticals (|Thomas et alj 
2007). We determine internal mass distributions and veloc- 
ity anisotropies just as for real galaxies, but since we know 
the corresponding properties of our test objects, we can ex- 
amine the models. 

The final goal of our project is twofold. Firstly, we want 
to explore possible systematic deviations that are caused 
by applying axisymmetric models to objects that do not 
respect any internal symmetry. Thereby, we want to under- 
stand how such deviations can be recognised when modeling 
a real galaxy, whose internal structure is not known a priori. 
Collisionless disc mergers are ideal for such a study, because 
they represent physically motivated dynamical systems that 
cover a large range of intrinsic shapes and dynamical struc- 
tures. 

By investigating how intrinsically non-axisymmetric 
systems are mapped onto axisymmetric models we also 
gain templates for the interpretation of real galaxy mod- 
els. A second goal of our study is therefore to compare 
the resulting Schwarzschild models of merger remnants with 
Schwarzschild models of real galaxies. Since we use the same 
modeling code in both cases, differences are indicative for 
structural differences between galaxies and the analysed 
merger remnants. Knowing such differences allows a deeper 
understanding of the physical mechanisms involved in ellip- 
tical galaxy build-up. 

The present paper focusses on the first part of the 
project. A detailed discussion of the results with respect 
to observations and models of real galaxies is planed for a 
future publication. We further plan to extend our survey of 
Schwarzschild models to samples of mergers involving gas 
physics and/or dynamical systems developing from cosmo- 
logical initial conditions. 

The paper is organised as follows: Sec. [2] describes the 
sample of merger remnants used for this work. Our imple- 
mentation of Schwarzschild's technique is reviewed in Sec.0 
Sec. [4] summarises tests with a Hernquist sphere. The mod- 
eling results are detailed in Sec. [5] (general notes), Sec. [6] 
(reconstructed masses) and Sec. [7] (reconstructed velocity 
anisotropies). In Sec. [8] we discuss various modeling uncer- 
tainties. Sec. [9] deals with the viewing-angle dependency of 
the total mass recovery and Sec.[T0]discusses the relation be- 
tween reconstructed luminous mass-to-light ratios and the 
central orbital structure of the merger remnants. Implica- 
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tions for models of real galaxies are briefly discussed in 
Sec. 1111 The paper closes with a summary in Sec. 1121 



2 MERGER SAMPLE 

A careful selection of the sample of merger remnants is 
crucial for our study. The merger remnants for this paper 
are taken from the coll isionless disc-disc merger sample of 
iNaab fc Burkertl J2003). Their progenitor galaxies consist 
of exponential discs and Hernquist bulges with a bulge- 
to-disc ratio of 1:3, embedded in pseudo-isothermal dark 
matter halos such that the overall circular velocity curve 
is approximately flat in the outer parts. With respect to 
their global kinematical and photometrical properties, these 
merge r remnants resemble giant ellipticals of intermediate 
mass l|Naab fe TruiillcfeOOfj ). For further details on the gen- 
eral properties and numerical details of ho w the merger rem- 
nants have formed we refer the reader to INaab fc Burkert] 
|2003t ). 

Different merger remnants result from different merging 
geometries, but here we select them only according to their 
shape and do not care how they have formed. As the shape 
of the merger remnants is very closely correla ted to their or- 
bital content (|jesseit. Naab fc Bur kert 2005), we know that 
sampling different shapes ensures that we explore a range of 
different orbital makeups as well. 



2.1 Orbital composition and shape of remnants 

According to rotational symmetry, all orbits in axisymmetric 
potentials conserve the ^-component L z of angular momen- 
tum and are minor axis tubes, or Z-tubes. Such Z-tubes can 
have various shapes between equatoria l-radial, equator ial- 
circular, shell-like and polar-radial fe.g. iRic hstonc 198^). 

In triaxial d ynamical system s we will expect more or- 
bit classes (e.g. de Zeeuwl ll985T ). In particular box orbits 



(most frequent in the centre) without net angular momen- 
tum, boxlets (resonant boxes found at larger radii) and in- 
ner and outer major axis tubes (also X-tubes in the follow- 
ing). Major-axis tubes have significant angular momentum 
around the long axis. As in axisymmetric potentials, also 
triaxial force fields support minor axis tubes, which have a 
non-zero angular momentum with respect to the short axis. 
The abundances of different orbit classes will depend on the 
exact shape of the merger remnant. 

The shape is determined by the ratio of the three prin- 
cipal axes of the inertial tensor calculated from the particle 
positions in the merger remnants. The main axes are de- 
noted: X (long), Y (intermediate) and Z (short) respectively. 
The corresponding values of the inertial tensor are a, b and 
c, respectively. 

2.2 Sample selection 

We choose six remnants that representatively sample the 
range of shapes realis ed by the collisionless mergers of 
INaab fc Burkertl (|2003l 1. including the most extreme cases: 
(1) a box-orbit dominated remnant (TRIAX); (2) one with 
a high X-tube fraction (PROLATE); (3) a nearly round ob- 
ject (ROUND); (4, 5) two very flattened remnants with dif- 
ferent inner shape profiles (FLAT and ELONG); (6) one 
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Figure 1. Projected cllipticitics of merger remnants. From top 
to bottom: X-projection, Y-projection and Z-projection. Radii are 
scaled by the half-light radius r e ff of the respective projection. 
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Table 1. Selected merger remnants. (1) Merger remnant; 
(2) progeni tor mass ratio; (3 ) me rging geometry according to 
Tab. 1 of INaab fc Burkertl d2003h ; (4-6) abundances of ma- 
jor orbit classes among the 40% most bound particles (from 
Ijesseit. Naab fc Burkertj|2005h . Irregular orbits and orbits with- 



out classification are not included in the table. 



oblate remnant, dominated by Z-tubes (OBLATE). Model- 
ing of further remnants from this sample would bring little 
additional information. 

Table [T] summarises orbital abundances. Respective el- 
lipticity profiles ex, £y and ez that result from projecting 
the remnants along the three principal axes X, Y and Z are 
shown in Fig. [1] 

The Schwarzschild models considered in this work al- 
ways assume oblate axial symmetry. Concerning projected 
ellipticities, axial symmetry implies either 



cz 



0, 



6y (oblate) 



ex = 0, ey = ez (prolate) 



(1) 



(2) 



Fig- Q] reveals that one of the six modelled merger remnants 
is consistent with oblate axial symmetry (OBLATE), while 
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Figure 2. Internal axis ratios of the merger remnants shown in 
Fig. [T1 as function of the fraction N^/Ntot of particles with the 
highest binding energy. From top to bottom: c/b, c/a and b/a. 



two others are marginally consistent with prolate axial sym- 
metry (ROUND, PROLATE). 

Generally, the remnant sample of iNaab fc Burkert] 

(2003) is deviant from oblate rotational symmetry in the 
inner regions. This can be inferred from Fig. [2] which shows 
profiles of internal axis ratios, calculated from the spatial 
distribution of the JVe most bound (luminous) particles 

(Ntot is the total number of luminous particles in the rem- 

I II 1\ 

nant; cf. Jess eit, Naab fc B urkert 2005). In terms of intrinsic 

axis ratios, oblate axial symmetry implies c < a and a = b. 



2.3 Mock Observations 

We will model the 'observational' data of the projections 
along the three principal axes for each remnant. Almost 
all observational properties, photometric or kinematic, will 
reach their maximum or minimum values at these projec- 
tions. This is so because the principal axes are also the sym- 
metry axes of the various orbit classes which means that par- 
ticles will move perpendicular or parallel to one of the axes, 
depending on orbit class and projection. Consequently, the 
moments of the line-of-sight velocity distribution (LOSVD) 
will reach extreme values for the respective projections. Sim- 
ilar reasoning can be applied to the p hotometric properties, 
e.g. t he isophotal shape parameter 04 dBender &: Mollenhofj 
1 19871 ). Concerning Z-tubes in triaxial potentials, for exam- 
ple, 04 reaches its extreme values in the long-axis projection 
(most boxy) and intermediate-axis projection ( most disky), 
respectively (cf. I Jesseit. Naab fc Burkertl [20051 for detailed 
discussion). In summary: by modeling the principal projec- 
tions, we are testing the extreme cases, where the influence 
of certain orbit classes on the observables is visible (or not). 
Influence of the viewing angle on our results will be discussed 
in detail in the following sections. 

As we want to make meaningful statements about the 



recovery of real galaxy structure, we try to imitate obser- 
vational conditions o f a comparison samp le of Coma el- 
lipticals modelled by iThomas et alj l|2007h with the same 
Schwarzschild code. This comparison sample consists of 
long-slit major and minor axis spectra l|Mehlert et al.ll2000l : 
IWegner et al.l [2002). The photometry for the Coma galaxy 
models is constructed as a composite of HST (centre) an d 
ground based imaging (outer parts; cf. IThomas et al1l2005l ). 

The photometry of the iV-body mergers is obtained for 
two different resolutions as well. The coarse one is mimick- 
ing ground based observations and extends to large radii. 
The one with higher resolution simulates HST data at small 
radii. We include seeing by smoothing the particle distribu- 
tion with a Gaussian of width 1/20 r e s (roughly the size of 
the numerical softening parameter) in the high resolution 
case. In the coarse resolution case the seeing amounts to 
three quarters of the effective radius. Low and high resolu- 
tion photometr y are combine d into one continuous profile, 
as described in IThomas et ail (|2005T ). 

To match the observational conditions of the Coma 
comparison sample the kinematic information is extracted at 
an intermediate resolution (smoothing width l'.'2 ~ l/6r e ff) 
along the apparent photometric major and minor axes (cf. 
Sec. l3.2p . We have kinematic dat a out to about two effective 
radii (see lNaab fc Burkertl I200II for extensive discussion on 
how artificial observations are performed on N-body rem- 
nants) . 



3 SCHWARZSCHILD MODELING 



Our S chwarzschild models are described in IThomas et alj 
l|2004 ) where a thorough discussion of the modeling imple- 
mentation is given. 



3.1 Model setup 

In the following, we briefly recall the basic steps of the 
Schwarzschild technique: 

(i) The surface brightness of each remnant-projection is 
deprojected at three inclination angles: the edge-on depro- 
jection probes the inclination where the Schwarzschild model 
is least flattened. The other extreme, the most flattened 
Schwarzschild model, is constructed at an inclination an- 
gle, for which the deprojection appears as an E7 galaxy 
(when seen from the side). Inclination angles resulting in 
intrinsically even flatter models are unreasonable because 
(1) ellipticals flatter than E7 are not observed and (2) rem- 
nants flatter than E7 are not in our merger remnant sample. 
As an intermediate case we also probe an inclination angle 
that leads to a Schwarzschild model resembling an E5 galaxy 
(when seen from the side). 

The luminosity distribution v is assumed to be axisym- 
metric and we include surface-brightness, ellipticity and 04- 
profilcs in the deprojectio n, which is perfor med with the 
non-parametric program of lMagorrianl |l999T l. 

(ii) Based on the deprojected luminosity- profile v a mass 
distribution is constructed via 

P = Ti/ + pdm, (3) 
where T determines the amount of mass that follows 
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the light (we will denote T the stellar mass-to-light ra- 
tio in the following). For the additional dark matter den- 
sity pdm we adopt a Navarro-Frenk- White (NFW) pro- 
file i^nd_tli*j^lation_Vjetween concentration and mass given 
in iNavarro. Frenk fc White] (|l99fj ). The dark halos of the 
merger remnants do not follow these profiles exactly. The 
progenitor galaxies are embedded in pseudo-isothermal ha- 
los with a flat central density core. After the merging the 
central dark matter slope steepens, but is still shallower than 
in NFW-profiles (cf. Sec. EH)) . 

Our choice for NFW-halos is motivated by results of 
Monte-Carlo simulations showing that one can always find a 
NFW-halo among the above introduced family which mim- 
ics an (non-singular) isothermal distrib ution sufficiently we ll 
over the radial region considered here l|Thomas et al.| [2005). 
Moreover, for a few remnant projections we have also calcu- 
lated cored logarithmic halos, and the results do not change 
significantly (cf. Sec. I6,2[l . Then, since we do not loose gen- 
erality, it is convenient to use the one-parameter family of 
NFW-halos. To explore possible effects of halo shapes we 
model each halo once with a spherical mass distribution 
and once with a flattening of the density distribution of 
c/b = c/a = 0.7, where a, b and c are the long, intermediate 
and short-axis of the halo mass distribution, respectively. 
The halos of the merger remnants are close to oblate-axial 
symmetry, with b/a > 0.9 and 0.7 ^ c/a ^ 1.0. 

With the mass density fixed, the gravitational potential 
$ follows by solving Poisson's equation. 

(iii) In the gravitational potential $ a representative set 
of orbits is calculated. The o rbit sampling is described in 
detail in lThomas et all l|2004h . 

(iv) In the final step the orbits are superposed to fit 
the photometric and kinematical constraints. The m aximum 
entropy-technique of IRichstone fc Tremaind (| 1988T > is ap- 
plied and the kinematic data is fitted by solving for the 



maximum of 



S- ax 



S is the entropy of the model and 



EE 

3=1 fc=l 



AC: 



(4) 



(5) 



measures the difference b etween input LOSV Ds £ m and 
model LOSVDs £ mod (see |jh omas et al ] |2004l for more de- 
tails about the calculation of S and £ in this context). Each 
LOSVD is binned into N vc \ velocity-bins and the input data 
consists of Nc LOSVDs in total (cf. Sec. 13.21 for further 
details). The luminosity density is treated as a boundary 
condition to equation Q. The regularisation parameter a 
in equation Q allows to control the relative importance 
of ^-minimisation (fit to data) and entropy-maximisation 
(smoothness of the distribution function). 

In the following we will consider two cases for a. Firstly, 
models obtained with a = will be called .Umax-models, be- 
cause for q == the x 2 -term vanishes and the orbital weights 
are entirely determined by the maximisation of S (under the 
boundary condition related to v). In order to fit an orbit li- 
brary to a given set of kinematical data, a has to be positive. 
The larger a, the better the fit will be. In case of real ob- 
servations, very large a can result in models that fit the 
noise in the data. Concerning our merger remnant fits, we 



assume that the input data is not affected significantly by 
noise (cf. next Sec. I3.2p . Therefore, as the second case for 
a, we consider a value large enough such that the minimum 
of x 2 is reached (x m i n -models). This usually occurs around 
a m iQ. Larger a do not change x 2 or other model properties 
significantly. 



3.2 Definition of x 2 

To solve equation Q one needs to evaluate the x 2_ f erm 
and, thus, to specify the ACf^ of equation ((5|. Insofar as 
the TV-body simulations are viewed as a discrete TV-particle 
realisation of an underlying continuous phase-space distribu- 
tion function, the mock observations should be interpreted 
to have some intrinsic Poisson scatter that decreases with 
increasing the number of particles. This describes the case 
of Sec. [4] where we test our modeling machinery with an TV- 
body representation of a Hernquist sphere. It is also valid 
for the setup of the progenitor systems. In both cases, the 
TV-body system is an imperfect representation of an un- 
derlying continuous phase-space distribution function. The 
merger remnants, however, are not such an TV-body sam- 
pling of some unknown distribution function. Instead, they 
just reflect the dynamical evolution of TV particles from their 
particular initial conditions - irrespective of how these have 
been constructed. In this sense, after the relaxation induced 
by the merging, we treat the mock observations as 'ideal' 
observations of a discrete (TV ~ 10 5 particle) dynamical 
system, that we try to represent by Schwarzschild models. 
There is no obvious way to define AC(^ in this case, however. 

For a statistical analysis the proper way to proceed is 
to add random fluctuations to the raw observations. The re- 
sulting noisy 'data' together with the 'error bars' from which 
the noise has been constructed provide a statistically con- 
sistent input to the models. However, our merger sample is 
small and it would be necessary to model several random 
realisations of the original raw data in order to avoid any 
influence of a particular noise pattern on the results. This 
is computationally too expansive as it means to model ef- 
fectively dozens of data sets. Moreover, it is not the goal 
of this study to quantify uncertainties that originate from 
observational errors (which has been done elsewhere, e.g. 
IThomas et~aT]|2005h . Instead, our aim is to explore possible 
systematic biases arising when treating non-axisymmetric 
objects with axisymmetric models. Therefore, we setup our 
model input as follows. 

First, Gauss-Hermite moments v, a, H3 and Hi 
(|Gerhard|[l993l ; lvan der Marel fc Franxlll993h of the m erger 
remnants are calculated as in iNaab fc BurkertJ (|200ll ). The 
Gauss-Hermite moments are then used to calculate the 
LOSVDs £i n at a set of radii typical for our comparison 
sample of Coma ellipticals. Corresponding observational er- 
rors of Coma galaxies at these radii are scaled to the mock 
dat£0- The Gauss-Hermite 'error-bars' are propagated into 



1 The exact value we will use is a = 0.9143 and arises from the 
iterati ve solution of equation 1(3} ; see, for example. IThomas et al.l 

iooi). 

2 We use fractional errors in v and a but absolute errors for H3 
and H4 . As template to create the error-bars we use the observa- 
tions of NGC4807, which are proto-typical for the Coma sample 
in terms of radial coverage and signal-to-noise. 
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ACf^ by means of Monte-Carlo simulations. The resulting 
LOS VDs £in ± A£j n are used as input for the Schwarzschild 
models without adding noise explicitly. Neglecting the noise 
makes uncertainties of derived model quantities (masses, in- 
ternal velocity moments) unreliable. But for our purpose of 
identifying systematic trends it is only important to flag a 
bestfit model in a similar way as a bestfit model is deter- 
mined for a real galaxy. The role of A£? n is to specify the 
relative weight of different data points. The usage of error 
bars from real observations ensures that in our models data 
from different spatial regions are weighted similar as in mod- 
els of real galaxies. 

3.3 The bestfit model 

To obtain the bestfit dynamical model we calculate 
Schwarzschild models on a grid in the two-dimensional pa- 
rameter space (T,c). Thereby we probe 0.3 ^ T ^ 1.3 with 
AT = 0.1 and 2.5 < c < 30.0 with Ac = 2.5. For each 
pair (T, c) on the grid one model is calculated with a spher- 
ical halo and another one with a halo flattening of q = 0.7. 
The procedure is repeated for up to three inclinations (cf. 
Sec. [31}. 

For this first set of models we use a coarse library setup 
with 2 x 3500 or bits, roughly half the number used to model 
Coma ellipticals (|Thomas et al.|[2007l ) and roughly twice the 
number that ha s been used by the Nu ker team for models of 
galaxy centres (Gcbhar dt et alJl200Ch . Among the low res- 
olution models one, say with parameters (T / , c/ , qf) , yields 
the lowest \ 2 ■ Around these parameter values we recalculate 
models with a larger number of orbit s (2 x 9000 orbits a s 
used for models of Coma galaxies by iThomas et alj l2007h . 
The overall bestfit model is chosen among these high reso- 
lution fits according to the minimum of \ 2 ■ 

For the high resolution models we adjust the modeling 
strategy as follows. (1) As it will be discussed in Sec. 15.31 
the best-fit (low resolution) model is always at an inclination 
i = 90° . For the high resolution case we therefore only con- 
sider edge-on geometries. (2) We examine the same grid for 
(Y,q) as in the low-resolution case, but restrict concentra- 
tions around c/ , usually probing the region between cj — 2 Ac 
and Cf + 2 Ac. If necessary we extent the concentration in- 
terval such that the bestfit high resolution model never oc- 
curs at the boundary of the sampled parameter space. When 
resampling with a larger number of orbits, we vary the halo- 
concentrations in smaller steps of Ac = 1.0. 

We do not find systematic differences between the mod- 
els with 2x3500 orbits and those with 2x9000 orbits, respec- 
tively. For example, twelve out of eighteen best-fit luminous 
mass-to-light ratios T nt are the same in low-resolution and 
high-resolution models. In the remaining cases they change 
by ATflt = 0.1 (four models) and ATj t = 0.2 (two models), 
respectively. There is no preferred direction for the change 
ATflt. 



4 VALIDATION: A HERNQUIST SPHERE 

To check all conversions from iV-body systems to 
Schwarzschild models and back we first model a self- 
consistent Hernquist sphere l|Hernquistll9 90) : we sample the 
isotropic Hernquist distribution function with N = 1.6 x 10 5 
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Figure 3. Schwarzschild models for a Hernquist sphere. In each 
panel ten solid grey lines represent models for ten Monte-Carlo 
samplings of a Hernquist sphere. The analytically derived refer- 
ence profiles of the Hernquist sphere are shown by black, dashed 
lines. Three upper rows: radial (ay, top), meridional (tr$, mid- 
dle) and azimuthal lower) velocity dispersions (luminosity- 
weighted, spherical averages); bottom row: density. Left column: 



-fits; right column: S max -models (details in the text). 



particles and 'observe' the resulting JV-body realisation in 
exactly the same way as the merger remnants. The num- 
ber of particles N = 1.6 x 10 5 resembles the number of 
luminous particles in the l:l-merger remnants, analysed 
later on. For the goal of verifying our machinery by recon- 
structing the Hernquist sphere we modify the modeling pro- 
cedure as follows: (1) We only consider an inclination of 
i — 90°, such that the deprojection is unique. (2) We only 
fit self-consistent Schwarzschild models, because the Hern- 
quist sphere is setup self-consistently (without dark matter) . 
Finally, in order to evaluate the influence of noise in the N- 
body representation, we combine Schwarzschild fits to ten 
different Monte-Carlo realisations of the Hernquist sphere. 
The only free mass parameter in this test run is the stellar 
mass-to- light ratio T. 

Application of our Schwarzschild models to the Hern- 
quist sphere yields Tfl t /Ti n = 0.993 ±0.037, where Tfl t and 
Ti n are the mass-to-light ratio of the Schwarzschild mod- 
els (averaged over fits to ten realisations of the Hernquist 
sphere) and the iV-body input, respectively. The quoted un- 
certainties reflect the variance about the mean. 

Internal velocity moments of Schwarzschild fits are 
shown in Fig. [3] together w ith the analytic profiles for the 
isotropic Hernquist sphere l|Hernauistj|l990l ). Results from 
Schwarzschild modeling are spherically averaged. For the 
Xmi n -models in the left panels of Fig. [3] a regularisation pa- 



rameter a w 1 has been used (cf. Sec. 13.11 the same value 
is used for the merger remnant fits). Apart from some noise 
in the Schwarzschild models, the overall agreement between 
analytic calculations and Schwarzschild fits is very good. 
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To understand whether the scatter in Tfi t /T m and 
the internal moments originates from uncertainties in the 
Schwarzschild code or whether it comes from noise in the 
JV-body realisation, we also tried to reconstruct the Hern- 
quist sphere by a method that is independent from noise 
in the JV-body kinematics: the solution of equation ((4]) for 
a = 0. As stated in Sec. 13. II with a = the x 2 -term vanishes 
and the orbit distribution is determined entirely by maximis- 
ing its entropy (.Umax-model). The idea behind considering 
•Umax-models here is the following: the maximisation of S 
yields, in a sense, the smoothest distribution function (DF) 
for the given density profile. Assuming that this smoothing 
isotropises stellar velocities then the .Umax-model would be 
identical to the (unique) isotropic DF, which is connected 
to any self-consistent spherical density profile. Maximising S 
would therefore determine the orbital weights (and internal 
moments etc.) of our orbit representation of the Hernquist 
sphere without any fit to the kinematics. 

Since the orbital weights in the Smax-models are fixed, 
the only degree of freedom is the velocity scale T. Results 
of the corresponding fits are shown in the right panels of 
Fig. [3] As can be seen, the internal moments of the Smax- 
models follow closely the analytic profiles, confirming the 
above speculations about the connection between entropy 
and isotropy in spherical systems. That the .Umax-models 
in fact match better with the analytical Hernquist profiles 
than the fits on the left implies that the scatter in the fits is 
mainly caused by noise in the TV-body LOSVDs. Uncertain- 
ties in the Schwarzschild code (finite number of orbits and 
finite numerical resolution) are instead negligible, as other- 
wise deviations between reference moments and orbit repre- 
sentation would be larger. Likewise, since the Smax-models 
in the right panels of Fig. [3] are based upon the deprojected 
JV-body light-profiles, noise in the JV-body light-profiles is 
also not the dominant driver for scatter in the left panels. 

Concerning mass-to-light ratios we find Tfl t /Ti n = 
1.007±0.016 in the mean over all ten Smax-models. As stated 
above, the remaining scatter of about 1.5 percent is due to 
noise in the JV-body kinematics. We do not expect this scat- 
ter to have a significant influence on our results of fits to the 
merger remnants. 



5 SCHWARZSCHILD FITS OF MERGER 
REMNANTS: GENERAL NOTES 

Now to the models of simulated merger remnants. This 
section contains notes on general properties of the 
Schwarzschild fits and the deprojections. 

5.1 Luminosity densities 

Fig. [4] compares the axisymmetric deprojections with the 
internal luminosity density profiles of the merger remnants. 
The figure only compares densities along the projected 
major-axis. Results along other position angles are similar. 
For the merger remnants, the density is averaged over a plan- 
parallel wedge of size Ar ~ 0.05 r c g along the major-axis, 
Az w 0.2r o fi perpendicular to this axis (in the plane of the 
sky) and A<p = 45° in the plane defined by the line-of-sight 
and the projected major-axis. 

If a remnant is seen along its long-axis (left panels), 
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Figure 4. Luminosity density of merger remnants (black, dashed) 
and merger models (coloured, solid). From left to right: models 
of X-projections, Y-projections and Z-projections. Densities are 
evaluated along the projected major-axis. 

then the axisymmetric deprojection overestimates the den- 
sity - especially near the centre. The opposite occurs if a 
merger is seen along the intermediate axis (middle panels): 
the axisymmetric deprojection of the Y-projections under- 
estimates the remnant density. Note that for the remnant 
in the bottom row (OBLATE) X and Y-deprojections are 
almost equal, consistent with its oblate shape (b w a). 

Fig. [5] illustrates that the viewing-angle dependency 
of the deprojections reflects the intrinsic non-oblateness 
b/a 7^ 1 of most of our merger remnantfl The light in- 
side an ellipse with b < a, if seen along the long-axis, is 
quenched into the region r < b in the axisymmetric depro- 
jection. Accordingly, the mean density of the deprojection 
inside b must be larger than the original density inside the 
same spatial region. Conversely, if the ellipse is viewed side 
on, the axisymmetric deprojection stretches the light into 
the larger region r < a and, hence, underestimates the true 
density. 

Concerning our merger remnants, deviations between 
deprojection and intrinsic light profile are largest where b/a 
is smallest (cf. Fig. [5]) - in accordance with the above rea- 
soning. At large radii, the intermediate-to-long axis ratio be- 
comes b/a ~ 1 and the deprojections of X and Y-projections 
approach the luminosity profiles of the remnants. 

Concerning the short axis, c/a quantifies the quenching 
of light along the line-of-sight as much as b/a quantifies it 
along the intermediate axis. Insofar, the Z-projection is simi- 
lar to the Y-projection, which explains why Z-deprojections 
underestimate luminosity densities of the mergers as well. 
A difference arises at large radii because b/a — » 1, whereas 
c/a stays roughly constant (e.g. FLAT, ELONG, OBLATE). 

3 We restrict the discussion to the edge-on case, since all our 
bestfit Schwarzschild models have i = 90° (cf. Sec. 15.3b . 
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Figure 5. Schematic view onto the (X, Y)-plane of a prolate 
body (b/a < 1). If the body is seen along the X-axis, then a 
deprojection assuming axial symmetry with the symmetry axis 
being perpendicular to the (X, Y)-plane, overestimates the den- 
sity inside r < b (small circle, red). Correspondingly, if the body 
is seen from the Y-axis, an axisymmetric deprojection underesti- 
mates the density inside r < a (large circle, blue). 



Consequently, Z-deprojections deviate over the whole radial 
range plotted in Fig. [4] and have a steeper slope than the 
luminosity profiles of the mergers. 
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Figure 6. Residuals between Schwarzschild fits and remnant 
LOSVDs normalised to the assigned error-bars: dv = (tifl t — 
Uj n )/A«i n (and analogously for a, H3 and H4). Left: major- 
axis; right: minor-axis; red/solid: X-projections; blue/dashed: Y- 
projections; green/dotted: Z-projections. 



5.2 Kinematic fits 

Because the merger remnants do not obey oblate axial sym- 
metry it is not clear whether their kinematics can be fit by 
our models - which respect this symmetry - at all. Resid- 
uals in the kinematic fits are shown in Fig. [6] Except from 
minor-axis rotation v and asymmetry of the LOSVD H3 
Schwarzschild models reproduce the data very well, to an 
accuracy of about a tenth of the assigned 'error bars'. Since 
these 'error bars' are taken from observations, a comparable 
degree of triaxiality in real galaxies would be hardly recog- 
nisable in terms of a systematic offset between models and 
data. 

Discrepancies between merger remnants and 
Schwarzschild models in minor-axis profiles of v and 
H3 are the result of oblate axial symmetry enforcing 
v = H3 = in the models. Hence, the upper-right panel 
of Fig. [6] in fact shows the amount of minor-axis rotation 
in the remnants. Neglecting the latter in our fits implies 
that part of the kinetic energy of the merger remnants is 
missing in the Schwarzschild models. This could lead to 
an underestimation of the mass. However, the minor-axis 
rotation in Fig. [6] is of the order of the assigned error bars 
(dv < 1), e.g. below 10 percent of the kinetic energy in the 
dispersion (cf. radial profiles of v, a, H3 and H4 and their 
assigned errors in App.|B)). We therefore do not expect that 
neglecting minor-axis rotation of the merger remnants has 
a dominant effect on our results. 

5.3 Inclinations of bestfit models 

Although we probe models at three different inclinations for 
each remnant projection, the bestfit model (with 2 x 3500 or- 
bits; cf. Sec. 13. 3p always occurs at an inclination of i = 90° 



(edge-on). This is not surprising for X and Y-projections. 
However, Z-projections could have been expected to be bet- 
ter represented by nearly face-on models, e.g. with i ~ 0°. 

However, according to the lower panel of Fig. [1] all rem- 
nants except the OBLATE one appear flattened when pro- 
jected along the Z-axis (ez > 0). Axisymmetric models, on 
the other hand, are necessarily round when seen along the 
axis of symmetry. Thus, an axisymmetric i — 0° model can- 
not fit the Z-projection of most remnants. 

Only one remnant (OBLATE) is close enough to ax- 
ial symmetry that its Z-projection is almost round. Why 
is the bestfit model for this remnant again achieved for 
i = 90°? The main reason is probably the small rotation 
signal » / and H3 7^ along the apparent major axis of 
its Z-projection (the face-on view is not exactly round, cf. 
Fig. [1]). At a viewing angle of i — 90° the model can adjust 
the balance between prograde and retrograde orbits to fit 
v 7^ and Hz 7^ 0. Instead, any rotation and asymmetric 
deviation from a Gaussian LOSVD disappear when looking 
at an axisymmetric system face-on: v = H% = (for all 
position angles). Thus, everything else being equal, a face- 
on model will necessarily have a larger \ 2 than an edge-on 
model. In fact 83 % of the Ay 2 between the bestfit edge-on 
and the bestfit face-on modejj of the OBLATE remnant, re- 
spectively, is due to differences in the fit to v and H3. This 
is not a proof, but a strong indication that the residual ro- 

4 The deprojectio n of axisymmetric bodies at i = 0° is in- 
finitely uncertain (iRvbickil [l987l; iGerhard fc Binnevl Il996l) . For 
the above comparison a face-on model has been constructed us- 
ing the (i = 90°) deprojection of the X-projection. If the rem- 
nant would be exactly axisymmetric, then this deprojection would 
uniquely recover its intrinsic luminosity distribution. 
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tation in the Z-projection of the OBLATE remnant is the 
main driver for the bestfit model to occur at an inclination 
of i = 90°. 

A triaxial dynamical system can exhibit various degrees 
of rotation in the Z-projection. If this indeed causes the 
corresponding axisymmetric fit to prefer an inclination of 
i = 90°, then the inclination mismatch is an unavoidable 
consequence of the false symmetry assumption. Concerning 
models of real galaxies, an additional complication enters 
by measurement errors: even for an exactly axisymmetric 
face-on galaxy one would determine v 7^ and Hj, 7^ 
due to measurement uncertainties. In such a best- 
fit axisymmetric inclination of 90° would be an artifact re- 
lated to the ability of the modeling machinery to fit the 
noise in the data. Proper regularisation could provide a way 
out of the inclination mismatch then. For the Z-models of 
the OBLATE remnant we find indeed a best-fit inclination 
i = 0° for a < 0.005 (strong regularisation). 

A systematic investigation of the question whether noise 
in real data can bias axisymmetric models towards i = 90° 
and whether this possible bias can be reduced by using 
proper regularisation is out of the scope of this paper. For 
simplicity, we adopt the same regularisation scheme to all 
merger remnants in the following. We expect this to signif- 
icantly affect only the fits to the face-on projection of the 
OBLATE remnant. Specifically, assuming the wrong incli- 
nation makes our Z-model worse than it could possibly be 
with optimised regularisation. In all other remnants the in- 
clination mismatch is due to intrinsic non-axisymmetry. 



6 MASS DISTRIBUTION IN REMNANTS 
AND MODELS 

Having discussed general features of the Schwarzschild mod- 
els we now turn to the comparison of the mass distribution 
in models and the corresponding merger remnants. 



6.1 Stellar mass-to-light ratio 

Fig. [7] shows the distribution of (scaled) mass-to- light ratios 
Tfit/Tin obtained from our bestfit Schwarzschild models. 
The reconstructed Tat systematically underestimate Ti n . 
Seventeen out of the eighteen models have mass-to-light 
ratios in the range 0.5 ^ T nt /Ti n ^ 0.9, one model has 
1" at /Tin = 0.4. Separating the results according to the view- 
ing angle yields that models of X-projections (shortly X- 
models below) recover the true mass-to-light ratio very well 
(T n t = 0.9 Ti n in all but one case; cf. upper-right panel 
of Fig. [7}. In contrast, Schwarzschild models of Y and Z- 
projections have mass-to-light ratios distributed almost ho- 
mogeneously in the range T nt /Ti n = 0.5 — 0.9 (bottom pan- 
els of Fig. [7}. Although the luminosity density of the depro- 
jection predicts less light in the corresponding models than 
there is in the merger. The reason for this behaviour will be 
further discussed in Sees. [9] and \W\ 

It should be restated that our mock observations are 
not drawn from random projections. Therefore, Fig. [7] does 
not equal the distribution of mass-to-light ratios that would 
result from modeling real galaxies (even if they would be 
structurally similar to the merger remnants). The most sig- 
nificant result here is that axisymmetric models tend to un- 
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Figure 7. Distribution of bestfit mass-to-light ratios T nt (scaled 
by the input value Ti n ). Top left panel: whole sample; other 
panels: results for different projections (indicated in each panel; 
dashed lines: total distribution for comparison). 



derestimate the mass fraction that follows the light. We have 
no proof for the generality of this result, but since we have 
modelled all three principal projections for each remnant we 
do not expect models from other viewing angles to deliver 

Tfit > Tin. 



6.2 Mass densities 

Our Schwarzschild models (and the merger remnants as 
well) contain both luminous as well as dark mass and T only 
represents a fraction of the total mass. The next question is 
how well total and dark matter density profiles are repre- 
sented in the Schwarzschild models. To explore this, Figs. [8] 
- [TO] survey radial density profiles of models and remnants 
separately for the three principal projections. The figures 
show intrinsic densities along the projected major axis. The 
middle panels (luminous mass density) differ from the v- 
profiles of Fig. [4] only in the scaling (the stellar mass density 
equals Tfit x v\ cf. equation [3} . 

Evidently, in X-models not only the luminosity density, 
but also the total mass in the inner regions is overestimated. 
Exceptional is the X-model of the OBLATE remnant: be- 
cause the remnant is close to axial symmetry, no overesti- 
mation of the central density occurs. In Y and Z-models - 
parallel to the underestimation of the light - also the total 
mass density is underestimated. Again exceptional is the Y- 
model of the OBLATE remnant: the total mass is well recov- 
ered. This reflects again the axial symmetry of the remnant, 
according to which X and Y-projections are equivalent and 
both should allow a good reconstruction with our models. 

The case of the OBLATE remnant also reveals a slight 
degeneracy in the mass recovery. The best-fit X-model has 
Tfit = 0.7, while the best-fit Y-model is obtained with Tfit = 
0.9. Despite these different Tat, the total mass inside r e « is 
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Figure 8. Comparison of remnant (black/dashed) and 
Schwarzschild model (coloured/solid) mass-density profiles (along 
projected major-axis). Left/middle/right: total/luminous/dark 
mass. The figure surveys results from modeling X-projections. 

recovered with high accuracy in both models: 2.8 percent 
fractional accuracy in the X-model and 0.4 percent in the 
Y-model, respectively. Thus, the total mass can be recovered 
with about the same accuracy, even if luminous masses differ 
by about 20 percent. 

Independent of projection, central dark matter densities 
are overestimated in all Schwarzschild models. Most likely, 
this reflects our choice of NFW-profiles for the halos of the 
models (cf. Sec. 13. ip . In principle, an overestimation of the 
central dark matter density could cause an underestimation 
of the luminous mass for compensation. Near the centre, 
where the dark matter excess is most prominent, the lu- 
minous matter is, however, still a factor of ten larger than 
the dark matter density (at 0.1r e ft, for example). We there- 
fore do not expect the central over-prediction of dark matter 
to be important for the recovery of T. Moreover, while the 
dark matter excess is projection- independent, the underesti- 
mation of Tflt/Ti n is projection- dependent. 

Nevertheless we have additionally calculated a set 
of logarithmic (LOG) halos for one merger remnant 
( OBLATE; the grid u sed to sample the halos is described 
m iTh omas et alJl2007h . In case of the X and Y-projection 
LOG-halos allow a slightly better fit than NFW-halos (cf. 
bottom-right panels of Figs. [8] and [9}. As will become clear 
from the discussions in the next sections, these models are in 
no respect systematically different from the models of other 
remnants which are calculated with NFW-halos. 

We have also calculated logarithmic halos for the X- 
models of the TRIAX, PROLATE and ELONG remnants. 
In these cases as well as concerning the Z-model of the 
OBLATE remnant, LOG halos do not provide better fits. 
As a consequence, considering LOG-halos does not change 
Tflt of this models. It follows that the particular choice of 
the halo profile (between NFW and LOG) has little effect 
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Figure 9. As Fig. [8] but for Y-projections. 
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Figure 10. As Fig. [8] but for Z-projections. 



on our results. It merely influences the match to the dark 
matter component in a spatial region, where dark matter is 
a minor contributor to the total mass. 

Towards the outer edge of the kinematical data (r c s < 
t % 2r c ff), mass densities of Schwarzschild models and 
merger remnants agree reasonably well. This holds for the 
total mass, as well as for luminous and dark components, 
separately. Around 1 — 2r e «, integrated total masses of 
Schwarzschild models are accurate to about 20 percent. The 
Z-models of the most flattened remnants (FLAT, ELONG, 
OBLATE) are deviant by up to 40-50 percent. 
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Figure 11. Comparison of velocity anisotropics. Left: meridional 
J3$ of Schwarzschild models (solid/coloured) and of merger rem- 
nants (short-dashed/black); right: azimuthal fi v of Schwarzschild 
models (long-dashed/coloured) and of merger remnants (short- 
dashed/black); grey: ±0.2 absolute deviations from merger values 
for comparison. The figure shows results of X-modcls. 

7 VELOCITY ANISOTROPIES IN REMNANTS 
AND MODELS 

We now consider in more detail the internal dynamical struc- 
ture of the merger remnants and how it is represented by our 
Schwarzschild fits. 

7.1 Anisotropy profiles 

Figs. If f I- If 31 compare profiles of meridional anisotropy 

& = 1 - 2| (6) 

and azimuthal anisotropy 

^ " 1 - i r (7) 

of Schwarzschild models and merger remnants. We use 
spherical coordinates r, ■& and <p, oriented along the prin- 
cipal axes such that cp is the azimuth in the (X, Y)-plane 
and ■& is the latitude. The velocity dispersions are luminos- 
ity weighted spherical averages. 

In Figs.[TT1-[T3lA/3 = ±0.2 margins are highlighted. The 
choice of these margins is arbitrary, and is only to guide a 
quantification of deviations between mergers and models. In 
most cases these are smaller than A/3 < 0.2. But there are 
some outliers (mostly among Z- models). As a general rule, 
X and Y models fit better with the intrinsic properties of 
the merger remnants than Z-models. 

The mismatch of the Z-models is partly due to the fact 
that the bestfit Schwarzschild models are always achieved 
for an inclination i = 90° (cf. Sec. I5.3|l . Apart from the re- 
lated mismatch in the deprojection it raises a complication 
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Figure 12. As Fig. 1111 but for Y-models. 




Figure 13. As Fig. [TT] but for Z-models. 



concerning the comparison of the internal moments: in the 
Z-projection of a merger remnant, according to the above 
definitions, the azimuth tp appears as the angle in the plane 
of the sky. In the Schwarzschild models, however, t? as de- 
fined above is the angle in the plane of the sky, as long as 
i = 90°. 

Much of the discrepancies between Schwarzschild mod- 
els and merger remnants can be attributed to these different 
coordinate definitions. To show this Fig. [14] replicates the 
same profiles as Fig. If3l but j3 v of the Schwarzschild mod- 
els is now compared to f3$ of the merger remnants and vice 
versa. The differences between the mergers and the models 
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Figure 14. As Fig. 1131 but Schwarzschild (3 V (long-dashed) are 
compared to merger remnant (left panels) and Schwarzschild 
Ptf (solid) are compared with f3 v of the remnants (right panels). 



are significantly smaller in Fig. [I4]than in Fig. 1131 especially 
among the most strongly flattened remnants. 



7.2 Interpretation in terms of orbits 

The remaining deviations between the anisotropy profiles of 
merger remnants and their corresponding Schwarzschild fits 
are most likely related to the different orbit families sup- 
ported by TV-body potentials on the one side and axisym- 
metric potentials on the other. 

Figs. [15] and [16] review principal projections of orbits 
numerically integrated in an JV-body potential (Fig. I15|l and 
in an axisymmetric potential (Fig. I16|l . Regions with v\ os > 
v± are plotted dark and regions with ui os < v± are plotted 
grey. Thereby v\ os is the absolute line-of-sight velocity in 
the given projection and v± is the absolute magnitude of 
the velocity perpendicular to the line-of-sight. In dark areas 
most of the kinetic energy of an orbit is directed towards the 
observer, whereas in grey areas most of the kinetic energy is 
in motion perpendicular to the line-of-sight. 

The tangential anisotropy of the X-model for the PRO- 
LATE remnant can be explained by the dominance of X- 
tubes in this remnant. According to Fig. I15l their round ap- 
pearance in the X-projection makes them most similar to the 
edge-on projection of axisymmetric shell orbits (cf. Fig. ll6[) . 
The latter, in turn, have large ct$ and low oy and cause the 
tangential anisotropy in the Schwarzschild model. 

Likewise, the similarity of f3$ in Z-models with j3 v of the 
merger remnants discussed at the end of Sec. I7.1l can be ex- 
plained by the fact that the dominant orbits in the outer 
parts of merger remnants, Z-tubes, appear nearly round 
when seen face-on. Again, they are likely mapped onto ax- 
isymmetric shell orbits, with the same consequence for the 
model's anisotropy as discussed for the X-model of the PRO- 
LATE remnant. 



Figure 15. Numerically integrated orbits in an iV-body poten- 
tial. From left to right: box orbit, X-tube and Z-tube. From top to 
bottom: X, Y and Z-projection. Coordinate definitions are illus- 
trated on the left hand side. Black: v\ os > v± ; light grey: v\ os < v± 
(details in the text). 
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Figure 16. As Fig. 1151 but for the case of an axisymmetric po- 
tential. From left to right: radial, shell-like and nearly circular 
orbit. 



8 MODELING UNCERTAINTIES 

Up to now we have presented the viewing-angle depen- 
dency of the masses and anisotropics which we reconstructed 
with our axisymmetric orbit models. The behaviour of the 
anisotropy could be explained by the way in which projected 
properties of major remnant orbit families match with dif- 
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ferent axisymmetric orbits. The recovered masses are less 
easy to understand, in particular the low Tat- This section 
and the following two Sees. [9] and [10] are aimed to discuss 
the mass recovery in more detail. We start this discussion 
here by investigating whether the projection-dependency of 
the mass-recovery in our axisymmetric dynamical models is 
an artifact of the modeling machinery. 

8.1 Stationarity assumption 

As it has been stated in the introduction, the Schwarzschild 
method is based on Jeans' theorem and the assumption that 
the object to be modelled is stationary. Non-stationarity of 
the merger remnants can have a significant influence on the 
recovered masses. For example, if a remnant contracts be- 
cause the ratio of its kinetic and potential energies is smaller 
than in virial equilibrium then a stationary model could de- 
liver a mass smaller than the true one. Likewise, if a remnant 
expands then the recovered mass-to-light ratio could be too 
large. 

Stationarity or virial equilibrium, respectively, implies 

that 



2T l3 = -Wi h 



(8) 



where Ty denotes the kinetic energy tensor and Wij de- 
notes the potential energy tensor. Equation |8j holds for 
the luminous and the dark components separately, if both 
are stationary. In the following we only consider the lumi- 
nous component. The calculation of its kinetic and potential 
energies is straight forward: 



T, 



■ («) • (<*) 
a = l 



0) 



where the sum extends over all the iV; luminous particles of 
the merger remnant (with mass m a each) and a^ a ' is the i- 
component of the velocity of particle a. The potential energy 
of the luminous component comprises the two contributions 



where 



W (ll) + W 



(id) 



Ni-l 



;(<*/3) 



W^ = -G^m a £ 



0=a + l 



and 



(«)_ x f) )(a >)_^h 



(10) 



(11) 



(12) 



J 3 1 

The sum extends over luminous particles only and r a p is 
the distance between particles a and /3, respectively. The 
contribution of dark matter comes in through 

N, M d ,( a /3) 

(13) 



w (id) 



a=l 0=1 



' u/3 



In the last sum Afd denotes the total number of dark matter 
particles in the remnant (with mass Mp each). In virial 
equilibrium total potential energy W = ^2 Wa and total 
kinetic energy T = Y, T " obe y 2T/\W\ = 1 

The six modelled merger remnants have 2T/\W\ £ 
[0.960, 0.981] with the lowest value for the FLAT remnant 
and the largest value for the OBLATE one. Thus, the rem- 
nants are very close to virial equilibrium and we expect that 



the assumption of stationarity in the models should affect 
the models' masses at most at the 5 percent level. Hence it 
is not the main driver for the low Tfl t /Ti n in our models. In 
addition Tfl t /Ti n is projection-dependent whereas 2T/\W\ 
is projection-independent. 



8.2 Phase space sampling 

Another potential uncertainty in the modeling procedure is 
the difference in phase-space structure of merger remnants 
on the one side and Schwarzschild models on the other: while 
the remnants are composed of a relatively large number of 
particles, each sampling a different orbit at one point, the 
Schwarzschild model is composed of a relatively low number 
of orbits, each sampled very densely (we use about 10 5 time- 
steps for each orbit integration). 

Concerning the sampling of the orbit (the time-step 
and total integration time used), our implementation of 
Schwarzschild's method has been successfully tested on con- 
tinuous analytical dynamical models, like for example a 
Hernquist sphere (|Thomas et al.| [2004). To check whether 
a similarly good agreement can be achieved when modeling 
iV-body systems, we have repeated the tests with discrete 
iV-body realisations as modeling targets (cf. Sec. 3}. The 
small uncertainties that we find imply that differences in 
phase-space structure are negligible. 



8.3 Chaotic orbits 

In the implementation of Schwarzschild's method applied 
here (as in most others) chaotic orbits are treated in 
the same way as regular orbits. This is not necessary in 
Schwarzschild models, but makes them computationally 
more efficient. A chaotic region in phase-space at fixed E 
and L z has to have a constant phase-space density accord- 
ing to Jeans' theorem. If such a region is represented by one 
(chaotic) orbit in the library, then the method works fine. 
However, it may happen that the (finite) integration time of 
the first orbit that is launched in the chaotic region is insuf- 
ficient to cover the accessible phase-space volume entirely. 
Then the program will launch one or more other orbits to 
fill up the rest of the chaotic region. It is then likely (al- 
though not necessary) that these fractional orbits will have 
different phase-space densities in the final model. As a conse- 
quence, the model no longer satisfies Jeans' theorem. Several 
suggestions have been made to overcome th is problem (e.g. 
iMerritt fc Fridman|[l996l : lHafner et al.ll2000l L 

Since the main consequence of chaos in phase-space is 
to break the stationarity of the Schwarzschild models, it 
should manifest itself in deviations from virial equilibrium 
and, thus, can be quantified by evaluating the virial equa- 
tions of the Schwarzschild models. To calculate the kinetic 
energy tensors Tij and Wij defined in equations (J5J) and 
(|10l) we have constructed TV-body realisations of each best- 
fit Schwarzschild model as described in App. lAl 

For the obtained virial ratios we find 2T/\W\ « 1 to 
within 15 percent. This limits the amount of chaos in our 
orbit libraries. Deviations from 2T/\W\ = 1 are not corre- 
lated with viewing-angle but with halo-concentration, which 
is an artifact of the iV-body realisation and further discussed 
in App. [A] Thus, the margins for intrinsic non-stationarity 
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are even smaller than the above quoted 15 percent. The re- 
lated uncertainties are not sufficient to explain the trends in 
the mass recovery. 



9 THE VIEWING ANGLE DEPENDENCY OF 
THE TOTAL MASS RECOVERY 

The last section has ruled out modeling uncertainties as the 
main source for the magnitude and projection-dependency of 
the mass recovery. We now investigate whether the different 
3-dimensional shapes of models and merger remnants are 
the main driver of this dependency. 

Globally, mass and kinetic energy are linked by the 
virial theorem, 2T — \W\ = kM, where n depends on the 
density profile. Accordingly, reconstructed masses M (flt) and 
input masses are related via 

(14) 



„(flt) 



1 v ' T J- DM 



with t' 6 '- 1 and T-fy^ denoting the kinetic energy of lumi- 
nous and dark matter in the Schwarzschild fit and T^ in ' and 
^dm being the analogue quantities of the merger remnant, 
respectively. Since it is basically T^ fit ' that is constrained 
by the LOSVD-fits, it is instructive to study first the en- 
ergy budget of the Schwarzschild models in comparison to 
the merger remnants. Then, equation (|14[) can be used to 
evaluate the implications on the reconstructed masses. 

Because the virial theorem relates energies to total 
masses this section deals with the viewing-angle dependency 
of the total mass recovery. The mass-to-light ratio of the 
stellar component will be discussed separately in the next 
Sec. [10] 



9.1 Energy budget of the Schwarzschild models 

In the merger remnants, by definition of the axes (cf. Sec. [5} 
T xx ^ T yy T zz , whereas oblate axial symmetry implies 
T xx = Tyy ^ T zz in the Schwarzschild fitfl In the following, 
it is convenient to switch from axis-labels referring to the 
intrinsic shape of the remnant (e.g. X, Y and Z as defined in 
Sec. [2} to projection-based labels: let us define the kinetic 
energy Ti os as the energy parallel to the axis that points 
towards the observer, T ma j as the energy parallel to the axis 
that projects to the apparent major-axis and T m i n as the 
energy directed parallel to the apparent minor-axis. 

Figs, im and 1181 show that the line-of-sight energy Ti oa 
and the ratio Tmm/T ma j of the two transversal energies 
are well recovered by the Schwarzschild models. This could 
have been expected since Ti os is the energy mapped by 
the projected kinematics. A mismatch in Ti oa should mani- 
fest itself in the kinematic fits. Some scatter remains, how- 
ever, because we do not assume full sky-coverage with kine- 
matic data. That the Schwarzschild models match also with 
Tmin/Tmaj of the remnants is plausible, because this energy 



5 We will only consider the diagonal elements Ta in the following, 
because in the merger remnants as well as in the TV-body real- 
isations of the Schwarzschild models the other components are 
at least two orders of magnitude lower and, thus, energetically 
negligible. 
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Figure 17. Correlation between kinetic energies of merger rem- 
nants (in) and Schwarzschild models (fit; along the line-of-sight). 
Dotted line: one-to-one relation for comparison. 
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models (fit), respectively. Dotted: one-to-one relation for com- 
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ratio determines the shape (e.g. iBinnev fc Tremainelll987f ). 
And the shape is accounted for in the Schwarzschild fits 
through the deprojected luminosity density, which is used 
as a boundary condition for our models. 

The two relations revealed by Figs. I17l and ll8l have sev- 
eral important implications for the energy budget and, thus, 
the recovered masses of the Schwarzschild models. 
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To see this, let's assume for simplicity that 

(fit) _ rp(iix) 



T 
and 



m(l» 
J los 



(fit) 



(in) 



-.(fit) 



y(in) 
maj 



(15) 



(16) 



(in other words, we replace the two approximate one-to-one 
correlations of Figs. 1171 and [181 by identities). By symmetry 



(fit) 



maj 
(fit) 



?i™ in the Schwarzschild fits and it follows 



r, 



los 
(in) 



los 



Moreover, according to equation (|16[) 



min maj 



rpfin) 
(fit) -'min 
T (in) ■ 
maj 



and by equation (|17[l : 



rp(Rt) _ j>(in) - 

min — los ^(in) ' 



(in) 



(17) 



(18) 



(19) 



Hence, the two relations (|15[) and ()16[) uniquely link the 
three relevant components of the kinetic energy tensor of 
the Schwarzschild model to the energy components of the 
merger remnant via equations (|15[) , (|17[) and (|19[) . Note that 
this holds only for edge-on models. If a model is calculated 
at i = 0° , then T^*' = by symmetry. In this case, even 

the identity T^ 4 ' = does not constrain the two ratios 
^V^min and T^/T^, respectively. This reflects the 
uncertainty in the flattening along the line-of-sight of the 
model. 

To express the expected energy budget for a 
Schwarzschild fit of a given merger projection more quan- 
titatively, it is necessary to figure out to which intrinsic 
axes Ti oa , T ma j and T m i n correspond. Simple algebra leads 
to Tab. [2] in which Schwarzschild model energies relative 
to remnant energies are given explicitly for each projection. 
The table allows to draw some important conclusions. In 
fact, for X-models the energy in the Schwarzschild model 
has to be larger than in the remnant, /T^ ^ 1 ; unless 
the remnant is oblate-axisymmetric. Contrary, in Y and Z- 
models T (flt) /T (in) sC 1 and the energy in the Schwarzschild 
model has to be smaller than in the remnant. 
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Table 2. Ratios of kinetic energies of Schwarzschild fits and 
merger remnants according to equations 11151 . II17I I and 1191 . 
(1) projection; (2) line-of-sight energy; (3) projected long-axis 
energy; (4) projected short-axis energy; (5) total energy (T = 
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Figure 19. Accuracy of reconstructed total (luminous + dark) 
mass inside r c g versus ratio T( flt )/T( m ) of total kinetic energies 
in Schwarzschild fits and merger remnants. 



9.2 Mass budget of the Schwarzschild models 

What are the consequences for the masses of the 
Schwarzschild fits? For simplicity let's first assume that 
Then, equation (|14p predicts that the ratio 
j^(fit)^^-(m) Q £ recons t ruc ted and input masses equals the 
ratio of the corresponding total kinetic energies (luminous 
+ dark). However, due to the lack of kinematic informa- 
tion about the constituents of the dark halo, the models 

have no access to the dark matter kinetic energy of the rem- 
rfiti 

nants and T£, M is not constrained. As a consequence, the 
results on the energy budget are significant for the com- 
parison of reconstructed and input masses only inside a 
radius where luminous matter dominates, for example in- 
side r c ff. There, the contribution of the dark matter ki- 
netic energy is small. Assuming that its contribution is in 



fact negligible, then equation (|14|l and the last column of 
Tab. [2] imply M (flt) /A/ (in) ^ 1 inside r off for X-models and 
M (flt) /M (in) ^ 1 for Y and Z-models. The amount by which 
Af < - flt - ) exceeds M^ m ' in X-models should be comparable to 
the amount by which M^ 4 ' is reduced relative to M' m ' in Y- 
models and masses of Z-models should be smaller than those 
of Y-models. Fig. [19] relates the mass ratio M (flt) /M (in) 
(at the effective radius) to kinetic energies 

T (flt)/ T (in) 

globally confirms the just discussed trends between recon- 
structed masses and kinetic energies. 

Hence, at the half-light radius, masses of our 
Schwarzschild fits are closely related to the energy budget 
of the models. The energy, in turn, derives from the fit to 
the kinematics and the shape of the modeling target via 
relations (|15[1 and (|16[) . F° r edge-on systems, the restric- 
tions imposed by the assumption of axial symmetry together 
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with these two relations already uniquely determine the lu- 
minous kinetic energy and, hence, the mass budget of the 
axisymmetric fits. Thereby it turns out that X-models have 
to overestimate the true mass, while Y and Z-models have 
to underestimate it. 

We close this section with a few further comments on 
Fig. 1191 According to equation (|14[1 . the relation between re- 
constructed masses and reconstructed energies can be tilted 
with respect to a one-to-one relation if the K-ratio varies sys- 
tematically over the sample. In Sec. 15.11 we have discussed 
systematic variations of the deprojections with viewing- 
angle which could cause the tilt with respect to the one-to- 
one case revealed by Fig. 1191 In addition there is a correla- 
tion of the dark halo properties in the models with remnant 
projection, which could also contribute to the tilt in Fig. [19] 
(cf. Sec. I10.3[) . On top of that, the fact that we compare 
luminous kinetic energies with masses inside r e B adds to the 
uncertainties in the step from T (flt) /T (in) to M (flt) /M (in) 
and can also tilt the relation or increase its scatter. Other 
sources of scatter are scatter in the relations (1151) and (1161). 



For example, the two Y-models with M (flt) /M (in) > 1 in 
Fig. [19] correspond to the two Y-models above the one-to- 
one relation in Fig. 1171 



10 CENTRAL REMNANT STRUCTURE AND 
THE LUMINOUS MASS-TO-LIGHT RATIO 

In this section we discuss the results for the modelled lumi- 
nous mass-to-light ratios. In contrast to the total masses the 
disagreement between models and mergers cannot be simply 
traced to the energy budget. 

The upper panel of Fig. 1201 shows T n t/Ti n versus total 
kinetic energy T (flt) /T (in) - The scatter in Fig. [20] is much 
larger than in the corresponding Fig. [I9]which deals with to- 
tal masses. As expected from the tightness of the correlation 
in Fig. 1191 deviations from a one-to-one correlation between 
Tfit/Ti n and T (fit) /T (in) are correlated with the dark matter 
content of the fits: where Tfl t /Ti n is too low, the dark matter 
in the models overestimates the dark matter in the remnant 
and vice versa. A larger scatter in Fig. [20] than in Fig. [19] 
is not surprising because the reconstruction of the mass de- 
composition is less certain than the reconstruction of the 
total mass: there is some freedom in the modeling to shift 
mass from the luminous to the dark component (and vice 
versa) without changing the fit significantly. There are how- 
ever two striking trends: (1) Tg t is generally smaller than 
Ti n and (2) at a given value of T (flt) /T (in) Y-models suffer 
from a slightly stronger underestimation of T than models 
of other projections. 

The systematics in the lower panel of Fig. [20] indicate 
that Tat, unlike the total mass, is not merely set by the 
total kinetic energy but must instead depend on something 
else. To investigate this further, we have redetermined best- 
fit Schwarzschild models (with 2 x 3500 orbits; cf. Sec. I3.3[) 
under the condition that the luminous mass-to-light ratio is 
fixed to the true value of the mergers, T = Tin- The corre- 
sponding fits are illustrated in Fig. [21] in the same way as 
fits with optimised T nt are shown in Fig. [6] The most im- 
portant result is that Y and Z-models constrained to have 
the true mass-to-light ratio fail to fit the central kinematics: 
they predict too large central velocity dispersions. X-models 
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Figure 20. Top: Accuracy of reconstructed luminous mass-to- 
light ratio T n t/Ti n versus ratio T' flt '/T^ m ) of total kinetic ener- 
gies in Schwarzschild fits and merger remnants. Bottom: Devia- 
tions A = T( fit ) /T^ 111 ' — T nt /T; n from a one-to-one correlation in 
the top panel versus accuracy of reconstructed dark mass (inside 
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Figure 21. As Fig. [6] but for Schwarzschild models with T 
T 

i in* 



in Fig. [21] are only marginally different from those in Fig. [6] 
because most X-models yield T nt ~ Tj„ to within ten per- 
cent in any case. 



Schwarzs child models of N -body merger remnants 17 



10.1 The relation between flattening and central 
kinematics 

That axisymmetric models of Y and Z-projections with 
Tflt = Ti n overpredict the velocity dispersion somewhere 
along the equator, can be qualitatively understood as fol- 
lows. The flattening of an axisymmetric body always comes 
along with some excess of energy parallel to the (X, Y)- 
plane. According to rotational symmetry, this excess energy 
is equally distributed in X and Y, respectively. Consequently, 
the projected kinetic energy in directions where the object 
looks most flattened is always relatively large. Only details 
of the radial distribution of the projected energy are not 
strongly constrained by the flattening alone. For example, 
if the flattening comes predominantly from near-circular or- 
bits, then the central projected energy can be relatively low 
(circular orbits cross the central line-of-sight with zero line- 
of-sight velocity), while most energy resides in the outer 
regions. If radial equatorial orbits are responsible for the 
flattening, then the central projected e nergy (velocity dis- 
persi on) is relatively large, instead (e.g. iDehnen fc Gerhardl 
Il993h . 

The situation in triaxial bodies is different: again there 
is an excess of energy in the (X, Y)-plane as soon as c < a 
and c < b. But this energy is no longer distributed equally 
between X and Y. If b < a, then there is more energy paral- 
lel to X than parallel to Y. It follows that the Y-projection, 
in which the object is most flattened (one sees the shortest 
and longest axis in projection), has relatively low projected 
kinetic energy. In other words, Y-projections of triaxial sys- 
tems can be highly flattened in combination with a low spe- 
cific projected energy (e.g. line-of-sight energy per mass). 
Now, since the flattening in axisymmetric systems is con- 
nected to a relatively large specific projected energy around 
the equatorial plane, a mismatch of projected dispersions 
somewhere around the equator is plausible (if the masses 
are equal). 

10.2 Flattening by box orbits and T nt 

So far the general situation. The fact that the specific dis- 
persion of the axisymmetric models with T nt = Ti n is too 
large near the centre (cf. Fig. I21[) is most likely connected 
to the specific structure of the here analysed merger rem- 
nants. They become prolate near the centre (cf. Sec. 12. 2p 
and this inner prolateness i s connected to particles movin g 
preferentially on box orbits l|jesseit. Naab fc Bur kert 2005). 
Fig. [21] shows the interplay between intrinsic central shape 
and projected central kinematics. The former is quantified 
by the axis-ratio b/a calculated from the spatial distribu- 
tion of the 10 percent most bound particles and the latter 
is expressed in terms of (#4)0 (calculated inside an aper- 
ture of 2" - about r e ff/3 or 1 kpc at the Coma distance of 
d = 100 Mpc). The figure shows that high (Hi)o occur in 
Y-projections (low line-of-sight velocities), while low or neg- 
ative (Hi)a appear when viewing the prolate centres end- 
on (large line-of-sight velocities) . Differences between X and 
Y-projections increase with decreasing b/a. Finally, as ex- 
pected for nearly prolate systems, Z and Y-projections are 
almost equivalent. 

In principle, the projected central velocity dispersion 
should show analogue trends. However, it also depends on 
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Figure 22. Central (#4)0 in X and Y-projections as function of 
b/a (10 perc ent most bound p articl es). Small symbols: merger 
remnants of iNa ab fc Burkert (2003); large symbols: subset of 
modelled merger remnants in this work; lines trace the mean 
of the distributions. For comparison also the mean of the Z- 
projections is indicated. 



the total mass M and size Rh of a system and has to be 
normalised before different objects can be compared. One 
option is to use 

Mo = y== , (20) 
y/GM/R h 

where Rh is the half-mass radius of the light distribution and 
do is the central velocity dispersion, measured in the same 
aperture as {H^q. The top panel of Fig. l2Ul shows that (Ha)o 
and (<t)o are closely correlated: high (#4)0 come along with 
low projected dispersions and vice versa, as expected from 
our above discussion. In addition, high (H±)o (and low (<r)o, 
respectively) are connected to large projected ellipticities, as 
illustrated in the lower panel of the figure. Concluding, inner 
box orbits in the merger remnants indeed cause a situation 
as described in Sec. ll0.il Y and Z-projections of the remnant 
centres have high flattening in combination with low (cr)o 
and high (1/4)0 ■ 

How are the remnant centres mapped onto the axisym- 
metric Schwarzschild models? Particles on box-orbits stream 
perpendicular to the line-of-sight of Y and Z-projections. 
The respective high H4 values provoke radial anisotropy in 
the inner regions of Y and Z-models (lower panel of Fig. [24)) . 
It has been mentioned above that axisymmetric systems 
which are flattened by equatoria l radial orbits have large 
central velocity dispersions (e.g. IDehnen fc Gerhard! 1 199a ; 
see also the edge-on projection of the axisymmetric radial or- 
bit in Fig. ll6[) . However, the Y and Z-projections of the rem- 
nants are characterised by the opposite: low central velocity 
dispersions. Consequently, Y and Z-models constrained to 
have T = Ti n predict a too large central velocity dispersion 

(cf. Fig. mj. 
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Figure 23. Central {^4)0 versus {(t)o (top) and projected 
cllipticity eo.i of the 10 percent most-bound particles (bot- 
tom). Small symbols: all remnants; large symbols: remnants with 
Schwarzschild models. 



Assuming that the anisotropy in the models is fixed by 
the constraints imposed through H4, then there is only one 
way to match the low central dispersion of the remnants: 
to reduce the inner mass. The inner mass, in turn, depends 
on only one parameter, T, because both, mergers and mod- 
els, are virtually free of dark matter in their centres. Ac- 
cordingly, if it is indeed the flattening by box orbits in the 
merger remnants, e.g. the combination of high ellipticity, 
positive {Hi)o but low (cr)o that causes the low Tg t in Y and 
Z-models, then we would expect the following behaviour of 
the models: the larger {Hi)o, the larger the radial anisotropy 
in the Schwarzschild fits. The larger the anisotropy, in turn, 
the larger (a)o for T = Tj n . Thus, we expect that models 
which are forced to be more radially anisotropic will have 
a lower Tat f° r compensation. The upper panel of Fig. fJH 
confirms that indeed the lowest Yfi t /Yi n appear in models 
that are most strongly radially anisotropic near the centre 
(at 0.2r cff ). 

Hence, the systematically low Yfu of Y and Z-models in 
Fig.[20]most likely reflect a lack of appropriate counter-parts 
of box orbits in axisymmetric potentials that can support a 
high flattening in combination with a low central velocity 
dispersion (per mass). 

The underestimation of Y in Y and Z-models is at first 
glance similar to the underestimation of the total mass in 
these models (cf. Sec. l9.2p . However, models of X-projections 
also deliver Yfit < Ti n . This is different from the reconstruc- 
tion of total masses, which are instead overestimated in X- 
models. On the one side, it should be noted that - irrespec- 
tive of Yfit < Yin - the total mass of X-models, even close 
to the centre, is often larger than in the remnants because of 
the overestimation of light in the deprojection (cf. Sec. 15. ip . 
On the other hand, the discussion of the Y and Z-models has 
revealed that Yfit depends primarily on the central orbital 



Figure 24. Meridional anisotropy at r = 0.2r o fj versus 
Tfit/Ti n (top) and versus H4 at same radius (bottom). 



structure and not, as the total mass, on the global energy 
budget. Therefore the reconstruction of Y is different from 
the reconstruction of the total mass. In particular, it most 
likely depends on the specific structure of the here consid- 
ered merger remnants. We cannot rule out that systems exist 
in which, say, {H±)o and the inner flattening combine in a 
way such that X-models would be forced to have Yat > Yi n . 
This needs further exploration of a broader sample of mod- 
eling targets. The X-projections of the here analysed merger 
remnants apparently do not require Tat > Yi n to be mod- 
elled adequately. 

10.3 The role played by dark matter 

In Fig. [24] Y and Z-models behave similarly (low Y nt , large 
radial anisotropy) as expected from the central box orbits 
in the merger remnants. In Fig. 1201 however, Y-models are 
different from Z-models in their dark halos: the underesti- 
mation of luminous mass in Y-models is compensated for 
by relatively massive halos, while this is apparently not 
the case in Z-models. According to Sec. [9] higher projected 
kinetic energies in Y-projections are responsible for their 
higher total masses (compared to Z-models) . The difference 
between Y and Z-models could be related to the different 
views on min or-axis tubes, that dominate th e outer parts of 
the remnants (I Jesseit. Naab fc BurkertT2 005). Y-projections 
map them edge-on such that they contribute significantly to 
the total line-of-sight energy. In Z-projection they appear 
face-on and most of their kinetic energy is perpendicular 
to the line-of-sight. The transition from side-on inner box- 
orbits to edge-on outer Z-tubes may be the origin for the 
local maximum in some dispersion profiles of Y-projections 
(TRIAX, FLAT; cf. App.jB). Z-projections lack of a similar 
maximum, as expected if it is caused by the edge-on view 
on Z-tubes. The increase of a at the transition from box or- 
bits to Z-tubes could explain why the low Yfit of Y-models 
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(set by the fit to the central remnant kinematics) have to be 
compensated for with massive halos at larger radii. Z-models 
do not need such compensating halo components, because a 
drops smoothly. 

It appears that for our modeling the inclusion of dark 
matter in the fits has two main effects: (1) it allows to trace 
the true mass structure of the modeling targets in the outer 
parts (because the remnants contain dark matter); (2) in 
some cases it can improve the fit to the central kinematics 
by allowing for an artificial (A//L)-gradient over the spatial 
region dominated by luminous mass. In particular it offers 
to combine low central T with larger outer M/L. This sec- 
ond issue related to dark matter in the models raises the 
question whether fitting self-consistent triaxial systems with 
self-consistent axisymmetric models might be different: the 
mass-to-light ratio T of the fits is then constrained by the 
inner as well as the outer kinematics. Accordingly, the qual- 
ity of the overall fit could be less good. For example, in 
the self-consistent case T cannot be reduced arbitrarily to 
match low central (<r)o such as those arising in specific Y- 
projections, as otherwise the mass in the outer parts would 
be insufficient to fit the corresponding kinematics there. 
Consequently, the scatter in the relation shown in Fig. [17] 
could increase. This, in turn, would affect the conclusions 
drawn about the projection-dependency of the mass budget 
of the axisymmetric fits, because they are partly based on 
this relation. 

In summary, the luminous mass-to-light ratio, because 
it is the only parameter that controls the central mass, de- 
pends not primarily on the total projected energy (like the 
total mass). Instead it is more sensitive to the central orbital 
structure of the merger remnants. In particular the low Tfl t 
of Y and Z-models result from the lack of orbits that re- 
semble side-on views of box orbits, e.g. can support high 
flattening, large transversal motions and low central line- 
of-sight dispersions. The radial anisotropy induced in the 
axisymmetric models by the transversal streaming of box 
orbits in the remnants requires a lowering of Tat to keep 
the central dispersion low. In Y-models massive halos partly 
compensate for the low outer luminous masses that result 
from the low Tfit required for the central fit. These massive 
halos are needed to fit side-on views of outer Z-tubes and 
the related relatively large velocity dispersions. Z-models do 
not require massive halos, because when viewed face-on the 
outer Z-tubes produce relatively low dispersions that do not 
need particularly large masses to be fit. 



11 IMPLICATIONS FOR MODELS OF REAL 
GALAXIES 

In the last two Sees. [5] and \W\ we have followed the 
projection-trends in the recovery of total and luminous 
masses back to the restrictions imposed by axial symmetry. 
We now discuss very briefly some implications for models of 
real galaxies. 



11.1 A possible bias in the reconstruction of 
central masses 

The dependency of Tg t on the central kinematics of the 
merger remnants discussed in Sec. 1101 implies that Tfl t /Ti n 
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panel and the correlation between (<t)q and (-^4)0 shown in the 
top panel of Fig. 1231 Symbols as labelled in the bottom panel. 



is connected to {Hi)o and {a)o- Insofar as models and 
remnants are dominated by luminous mass near their cen- 
tres, one would expect that Tfi t /Tj n resembles the ra- 
tio M^ flt '/M' m - ) between reconstructed and input mass, 
evaluated at some radius near the centre. Consequently, 
M (flt) /M (in) should be connected to (H 4 )o and (a) as well. 

The relationship between M (flt) /M (in) (evaluated at 
0.2r e ff) and the central kinematical parameters {H^jo and 
(cr)o is shown in Fig. 1251 Because masses globally scale with 
velocities squared, we have chosen (<t)q as the ordinate in the 
top panel. Indeed the inner M^ flt - ) /M ( ' m - ) is closely correlated 
to (#4)0 and (er)o- Linear fits yield 

M (flt) 



and 

M (fit) 



= 44.33 x (a)l - 0.29 



= 1.2-13.1 x (#4)0, 



(21) 



(22) 



respectively (short-dashed lines in Fig. I25[) . The two re- 
lations should be almost equivalent, because according 
to Fig. [23] {Ha)o and (er)o are closely correlated in the 
merger remnants. In fact, the predicted relation between 
M (flt) /M (in) and (H 4 )o that derives from a linear fit to the 
top panel of Fig. [23] in combination with equation fl 2 1 p is 
consistent with the actual relation between the mass ratio 
and {Hijo that is shown in the lower panel of Fig. 1251 For 
comparison, this relation is indicated by the long-dashed 
line. 

Both quantities, (<r)o as well as (#4)0 are viewing-angle 
dependent. The case of the principal axes has already been 
discussed in Sec. 110.21 The full viewing-angle dependency of 
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Figure 27. Top: central ( #4)0 from random proje ctions of the 
merger remnant sample of Naab & Burkert] | |2003| ) (results are 
separated for different progenitor mass ratios); bottom: resulting 
central M( flt ) /M^ 1 ™) according to relation J22t . 



(Ha)q for all merger remnants of lNaab fc Burkertl (|2003l ) is 
shown in Fig. 1261 Thereby <P is defined as the azimuth in 
the (X, y)-plane and £ is the latitude. In accordance with 
our previous discussion (#4)0 peaks in Y and Z-projections 
and varies little with £ between these projections (prolate- 
ness). In the (X, Y)-plane (#4)0 decreases smoothly when 
approaching the long-axis projection. A similar behaviour 
of H4 with viewing-angle has been observed in iV-body bi- 
nary me rgers of discs with massive bulge s, but without dark 
matter (|Hevl. Hernquist fc Spergelll995l V 

If the here studied merger remnants would be seen at 
random projections on the sky, then the viewing-angle de- 
pendency of (#4)0 would give rise to the frequency distribu- 
tion shown in the top panel of Fig. 1271 Equal mass mergers 
would have on average the highest {1/4)0, because they are 
nearly prolate and the (ff 4 )o-distribution is dominated by 
the positive values around Y and Z-projections, respectively. 
Towards 4:1 mergers the average (-#4)0 decreases slightly. 

Assuming that relation (|22p holds for all viewing- angles 
and merger remnants, then the frequency distribution of 
(7^4)0 can be used to predict the distribution of recon- 
structed central M (flt) /M (in) . The latter is plotted in the 
lower panel of Fig. 1271 axisymmetric models of these merger 
remnants would be always biased towards too low central 
masses. The bias would be strongest for equal mass mergers, 
while masses of the more axisymmetric 4:1 mergers would 
be recovered better. 

Note, however, that for the construction of the lower 
panel of Fig. [57] we have assumed that the relation (|22|1 holds 
for models of all merger remnants and at all viewing angles. 
This needs to be verified on the basis of a broader sample 
of models. Likewise it is not clear whether the connection 
between (H 4 } and the central M (fit) /M (in) also holds for 
non-axisymmetric targets of more general shapes. 



11.2 A possible bias in the reconstruction of 
luminous mass-to-light ratios 

As stated above, if models and merger remnants are void of 
dark matter in their central regions, then M (flt) /M (in) w 
Tflt/Ti n . Accordingly, if the here analysed merger rem- 
nants would be seen at random projections on the sky, then 
Tflt/Ti n would be subject to a similar bias as the central 
M (flt) /M (in) . However, the relation between T fi t/T in and 
(^4)0, which determines the bias, is more scatterish than 
the one with the inner 

M (fit)y M (in) and it cannot he de _ 

scribed by a straight line (upper panel of Fig. I28[). The rea- 
son is that the total mass near the centre, say inside 0.2 r e g, 
is determined by the product of the central luminosity with 
Tat (and a negligible amount of central dark matter). The 
central luminosity however, is overestimated in X-models, 
but underestimated in Y and Z-models (cf. Sec. l5.1[) . Conse- 
quently, X-models with a specific T have larger M ( - flt ' ) /M^ 
than Y or Z-models with the same T. This is illustrated in 
the lower panel of Fig. 1281 

Regardless of the complicated shape of the relation be- 
tween {Ha)o and T nt /Ti n it is clear that reconstructed lumi- 
nous mass-to-light ratios of the here analysed merger rem- 
nants would be always underestimated. 

11.3 The mass recovery in models with T = Ti n 

If one is interested in the recovery of luminous mass-to-light 
ratios, one could use the relationship between {i? 4 )o and 
Tflt/Ti n (upper panel of Fig. I28|l to correct dynamically de- 
rived mass-to-light ratios (roughly) for the effects of intrinsic 
non-axisymmetry. However, the accuracy of this step is lim- 
ited by the systematics of the deprojections (lower panel of 
Fig. I28p . Another way to obtain unbiased luminous mass- 
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Figure 28. Top: (^4)0 versus Y nt /Yi n ; bottom: connection 
between the parameter ratio Y nt /Yj n and the actual ratio 
Af( flt )/M< in ) of integrated masses (luminous + dark) inside 
0.2r e ff. The dashed line marks the identity relation. 



to-light ratios would be to apply stellar population mod- 
els to line indices, although this requires knowledge of the 
initial-stellar-mass function. In any case it is interesting to 
ask, whether knowledge of the true Y could help to improve 
other aspects of the dynamical modelling, for example the 
recovery of the intrinsic anisotropy. To investigate this, we 
now compare Schwarzschild fits obtained under the condi- 
tion that the luminous mass-to-light ratio is fixed to its true 
value, Y = Yi n , with the merger remnants. Since most best- 
fit X-models with variable Y already have Y nt ~ Yi n to 
within ten percent (cf. Sec. 16. ip , fixing Y = Yi n does not 
change X-models significantly. We therefore skip X-models 
in the remainder of this section. 

Fig. [29] compares the intrinsic mass densities of Y- 
models with Y = Yi n to the merger remnants in a similar 
fashion as models with variable Y were compared in Fig. [5] 
The figure clearly shows an improvement in the mass re- 
covery when T is known. This holds for both, total as well 
as dark mass densities. Fig. 1301 covers the case of Z-models. 
Here, although again the mass recovery improves, the dis- 
crepancy between the density profiles of models and rem- 
nants in the outer parts remains. As it has already been dis- 
cussed in Sec. 15.11 the light-profile of Z-models differs from 
the remnants mainly in having the wrong slope. Knowing 
just the true scaling Tin cannot remove this mismatch. 



11.4 Anisotropy in models with Y = Yin 

Figs. [31] and [32] compare anisotropy profiles of Y and Z- 
models with Y = Yi n to the merger remnants. The general 
trend is that models with T = Yj n become strongly tan- 
gentially anisotropic (f3# < 0, /3 V < 0) in the outer regions 
(r > 0.5r e ff), especially Z-models. Towards the centres most 
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Figure 29. As Fig. [9] but for models with Y = Yi n 
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Figure 30. As Fig. [101 but for models with Y = T in . 



of the models shown in Figs. [31] and [32] become radially 
anisotropic (f3 v > 0), with a local peak around 0.1 — 0.3 r e ff . 

All in all then, fixing Y = Yi n improves the reconstruc- 
tion of the intrinsic mass structure, but deviations in internal 
velocity moments increase. 



12 SUMMARY 

We have modelled a set of collisionless disc-disc mergers with 
exactly the same axisymmetric orbit superposition program 
that has been used to model a sample of medium bright 



22 J. Thomas, R. Jesseit, T. Naab, R. P. Saglia, A. Burkert and R. Bender 




0.1 1 0.1 1 



r/r ,. 

' erf 

Figure 31. As Fig. 1121 but for models with T = T; n . Profiles of 
the bestfit models shown in Fig. 1121 are repeated for comparison 
(dotted). 
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Figure 32. As Fig. [31] but for Z-models. 

giant Coma ellipticals. The models assume a constant mass- 
to-light ratio for the luminous matter and a d ark halo of 
the NFW type (|Navarro. Frenk fc White] Il99rj ). The rem- 
nants we model result from the collisionless merger of pro- 
genitor systems composed of a bulge, a disc (both with the 
same mass-to-light ratio) and a dark halo. They are chosen 
to cover representatively th e range of intrin s ic sha pes and 
dynamical structures of the iNaab fc Burkert] (|2003h merger 
sample, including the most extreme cases. 

Intrinsic triaxiality causes a strong viewing-angle de- 



pendency of projected properties of the remnants. As in ax- 
isymmetric models many viewing angles are equivalent this 
must result in a corresponding dependency of the fits on the 
viewing angle of the remnant. The goal of this study is to 
investigate this dependency. 

Some projected properties of triaxial systems, for ex- 
ample minor-axis rotation or isophotal twists, are obvi- 
ously incompatible with oblate axial symmetry. Apart from 
these, we do not find any obvious mismatch between our 
Schwarzschild models and the merger remnants: residuals 
in the kinematic fits are smaller than typical observational 
errors. 

12.1 Remnant shapes and the mass recovery 

We find that the reconstruction of the total (luminous + 
dark) mass at the effective radius depends primarily on 
viewing-angle and not on the orbital structure of the merger 
remnants. This is so, because the global mass budget of 
the axisymmetric models is fixed by two constraints. Firstly, 
the match of the total line-of-sight energies of the luminous 
components of models and mergers, which is a consequence 
of the fit to the projected kinematics. Secondly, although 
the two transversal kinetic energies are not constrained in 
the models, the ratio of both has to be the same in mod- 
els and mergers, respectively: the ratio determines the flat- 
tening and is constrained by the fit to the luminosity den- 
sity. For edge-on models the restrictions imposed through 
axial symmetry then already fix the total mass budget of 
the Schwarzschild models. Thereby models of X-projections 
overestimate the true mass, models of Y-projections un- 
derestimate it on about the same level and models of Z- 
projections have masses even lower than those of Y-models. 
The exact amount of over or underestimation depends on 
the intrinsic shape of the merger remnants. 

In the here analysed merger remnants deviations of cu- 
mulative masses inside 1 — 2r c s are mostly below 20 percent. 
Extreme values of underestimations are larger than those 
of overestimations. The strongest underestimations occur 
among intrinsically very flattened, face-on systems, where 
the mass can be underestimated by up to 50 percent. The 
underestimation is due to a wrong inclination of the corre- 
sponding models, which arises from the fact that most rem- 
nants appear either flattened or show some residual rotation 
along the minor-axis, respectively, when viewed along their 
short axis. Both phenomena exclude face-on {i = 0°) oblate 
axisymmetric models, cause an underestimation of the lumi- 
nous kinetic energy and, hence, an underestimation of the 
mass inside the effective radius. 

The luminous mass-to-light ratio is always underesti- 
mated, Tfl t < Ti n . Unlike the total mass, it does not derive 
primarily from the total kinetic energy of the Schwarzschild 
model. Although T nt varies similarly with viewing-angle 
as the total mass, this variation is mediated by the cen- 
tral kinematics of the merger remnants. Box orbits cause 
a combination of high projected flattening and low line-of- 
sight motions (high central H4, low central a) in Y and Z- 
projections. Box orbits are mapped onto radial orbits in the 
Schwarzschild fits. The corresponding increase of the central 
velocity dispersion in the model then requires a lowering of 
the central mass to achieve a good match to both, the high 
central H4, and the low a. As the luminous mass-to- light ra- 
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tio Y nt is the only parameter that controls the central mass, 
Tat < Ti n in Y and Z-models (up to a factor of 2.5). Models 
of long-axis projections yield the best approximations to the 
mergers, in all but one case T nt w Tin to within 10 percent. 

The asymmetric motion of particles, especially those on 
inner box orbits, correlates central kinematical parameters 
like (cr)o and {H±)o with viewing- angle, a result that has al- 
rea dy been found in iV-body simulat ions without dark mat- 
ter (|Hevl. Hernquist fc Sp ergcl 1995|). The link between box 
orbit kinematics and T nt gives also rise to a correlation of 

the reconstructed central mass (relative to the merger mass) 
M (fit)/ M (m) with ^ H ^ o _ and 

viewing- angle. 
The deficit of luminous mass in models of projections 
along the intermediate axis is compensated for by dark halos 
that are more massive than in the remnants. Such massive 
halos are necessary to fit the relatively large outer disper- 
sions of these projections. Models of short-axis projections 
do not have massive dark halos: their outer dispersions are 
low. The different outer dispersions of projections along the 
intermediate and short axis, respectively, arise due to differ- 
ent views on minor-axis tubes dominating the outskirts of 
the remnants. 

If the luminous mass-to-light ratio is fixed to its true 
value, then the mass recovery of Y-models improves (on av- 
erage 6 percent accuracy at 2r e «). Models of Z- projections 
suffer from a wrong slope in the deprojection arising from 
the mismatch in the inclination already discussed above. As 
the inclination is wrong, mere knowledge of the true T does 
not improve the mass reconstruction of Z-models much. 

In general, improvements of the mass recovery are to 
the expense of strong outer tangential anisotropy in the 
Schwarzschild models, which weakens the match with the 
remnants' dynamics. 

12.2 Anisotropies in Schwarzschild models of 
mergers 

The viewing-angle dependency of merger projections in- 
duces a viewing-angle dependency of anisotropies in cor- 
responding Schwarzschild models. For example, X-tubes, 
when seen end-on, are nearly round and are represented 
by shell orbits in axisymmetric models, resulting in strong 
tangential anisotropy in the Schwarzschild fit. The same 
X-tubes seen side-on appear radially extended and are 
represented by radially extended orbits in the models, 
increasing their radial anisotropy. Because different or- 
bit famili es give rise to different viewin g-angle dependen- 
cies (e.g. IJesseit. Naab fc Bur kcrt 2005]) the anisotropy of 
Schwarzschild fits depends on projection as well as on the 
orbital make-up. 

Z-models of intrinsically flat, disc-like merger remnants 
are dominated by meridional motions (o» > ay). This can 
be explained as an inclination effect: as stated above, the 
flattening of most remnants requires an inclination i > 0° in 
the axisymmetric models. In this case, Z-tubes dominating 
the outskirts of the remnant project to nearly round shapes 
and are mapped onto axisymmetric shell orbits of the same 
shape - causing a meridional anisotropy. 

As found for the mass reconstruction, modeling the 
long-axis projection of a merger remnant yields a better 
match to the intrinsic structure, while the largest deviations 
between remnants and models appear among the short-axis 



models. In any case, deviations A/3 between Schwarzschild 
fits and merger remnants are below A/3 < 0.2 at most radii. 
Towards the centre and/or towards the outer regions devia- 
tions can be larger, however. 

12.3 Real galaxies 

We have tested our axisymmetric orbit superposition code 
on a sample of rather extreme merger remnants, covering a 
wide range of non-axisymmetric as well as axisymmetric, but 
highly flattened, dynamical systems. The aim was to probe 
the limits of the method. If real ellipticals would resemble 
the here considered merger remnants in terms of their or- 
bital structure, then random viewing angles would provoke 
scatter in anisotropies of axisymmetric dynamical models. 
Furthermore, dynamically derived stellar mass-to-light ra- 
tios would be on average underestimated and the amount of 
underestimation would be correlated with the central value 
of H4. We plan a detailed comparison of the here described 
models of merger remnants with models of Coma ellipti- 
cals in a future publication. If models of real ellipticals show 
less scatter in anisotropies or if there is no sign for a system- 
atic underestimation of stellar mass-to-light ratios, then this 
would be an indication for their intrinsic shapes to be closer 
to axial symmetry than in our iV-body merger sample. 

This would have two consequences. Firstly, axisym- 
metric galaxies would be recovered with higher accuracy, 
even better than the here analysed merger remnants whose 
masses and anisotropies at the effective radius are mostly 
well matched. This holds especially for flattened and rotat- 
ing systems, which are known to be close to edge-on. The 
case of round and non-rotating galaxies (even if they are 
axisymmetric) is more ambigous as the inclination of these 
systems is apriori unknown. 

Secondly, knowing that galaxies are close to axisymme- 
try provides clues about their formation: dissipation during 
the formati on can change el l iptical galaxy properti es signif- 
icantly fe.g. ICox et a l. 2006; Robe rtson et al.ll2006h . Partic- 
ularly, i t can drive the final object towards axial symme- 
try (e .g. lBarnes fc Hernqu ist 1996; Naa b. Jesseit fc Burkertl 
l2006f ). We therefore also plan to extend our study to binary 
disc mergers including gas physics, star formation and feed- 
back from a central black hole as well as on simulations from 
cosmological initial conditions. 
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APPENDIX A: iV-BODY REALISATIONS OF 
SCHWARZSCHILD MODELS 

In the following we describe the construction of JV-body re- 
alisations of Schwarzschild models. Thereby we assume that 
all particles have the same mass, e.g. that there is a global 
constant of proportionality linking the number density of 
particles in phase-space to the phase-space mass density. To 
simplify the notation, we also assume T = 1. The general- 
isation to particle masses varying from orbit to orbit or to 
systems with T 7^ 1 is straight forward. 

The orbit library of any Schwarzschild model provides 
a natural partition of phase-space into cells dV(i, j), where 
dV(i,j) is the phase-space volume of the cell that orbit i 
covers during time-step dt± . The probability p(i,j) to find 
a particle in cell dV(i,j) is proportional to 



P(i,j) « fidV(i,j), 



(Al) 



where fi is the phase-space density along orbit i. The lat- 
ter is related to the weight Wi of the corresponding orbit 
and its total phase -space volume Vi via fi = Wi/Vi (e.g. 
iThomas et al.l l2004). According to the time-averages theo- 



dV(i, j) ^ dt 



V, 



(A2) 



where n is the total integration time of orbit i. Hence, the 
probability to find a particle in phase-space cell dV(i, j) is 



p(i,j) = Wi 



dt 



(A3) 



If the luminosity is normalised such that ^ Wi = 1 then the 
p(i,j) of equation (|A3|) provide a complete partition of the 
interval [0, 1]: 

[0,1]= |J (J P(i,j), (A4) 
i=i j=i 

where 7V or b is the number of orbits in the library, N t (i) is 
the number of time-steps in the integration of orbit i and 

r [cuj, hj ) : 1 sC i «S Norb, 1 < j < N t (i), 
V(i,j) = < (i,j)^(N olb ,N t (N olh )) (A5) 

t [a,ij,bij] : i = iVorb, j = N t {N orh ) 

with 

i-i ^ 
11,,= > w fe + > Wi — - — (A6) 



w k + y^ Wi — 2 — 

fc=0 m = 



and 



bij = w fc + 



di„ (m) 



(A7) 

>i 

k-0 m=0 

(wq = and dt^ =0, 1 ^ i ^ JV or i,). The length of each 
subinterval equals the probability to find a particle in phase- 
space cell dV(i,j): 
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Figure Al. Correlation between the virial ratio 2T/\W\ and the 
halo-concentration in the Schwarzschild models. 
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Table Bl. Accuracy of reconstructed masses. (1) Merger rem- 
nant; (2-4) reconstructed Tfl t /Tj n for X, Y and Z-models, re- 
spectively; (5-7) fractional error AM = (Af< flt ) - A/< in ))/M< in ) 
of reconstructed (luminous + dark) mass inside r e ff for X, Y and 
Z-models, respectively. 



APPENDIX B: DATA AND MODELS 

Figs. lBl l- IB6l survev the kinematics of the individual targets 
(dots with error bars) and the model fits (lines). For each tar- 
get the bestfit model (solid; best-fit luminous mass-to-light 
ratios Tgt/Tm are given in the captions and summarised in 
Tab. IB 1 jl as well as the bestfit with T = Ti n (dashed) are 
shown. Tab. IB 1 1 also lists the accuracy of reconstructed total 
masses at r e g. 



bij - Ojj = to, — — = p(i,j). (A8) 

Ti 

An TV-body realisation with Ni particles of, say, the lu- 
minous component can now be constructed by choosing iVj 
random numbers k € [0, 1]. Each k falls into one subinterval 
V(ik,jk)- Accordingly, particle k has to be dropped on orbit 
ik during time-step jt- The A^-body realisation of the dark 
halo can be constructed in a similar way, provided corre- 
sponding orbital weights are given. For the discussion in this 
paper we only need an A-body representation of the dark 
halo density profile, not of the halo kinematics. Thus, we can 
choose any distribution function for the halo that supports 
its density profile. The A-body realisations of the halos in 
this work have been calculated from orbit weights that max- 
imise the entropy of the dark m atter distribution fun ction. 
Their calculation is described in iThomas et all (|2007n . We 
use Ni — Nd = 50000 particles to sample the Schwarzschild 
models up to 10r e ff. 

Cutting the A-body realisation beyond 10r o fr intro- 
duces a spurious correlation between the virial ratio 2T/\W\ 
and the halo-concentration Chaio in the Schwarzschild fits. 
This is shown in Fig. IA1I the less concentrated the halo, 
the larger 2T/\W\. The systematic offset 2T/\W\ > 1 of the 
A^-body realisation is thereby most likely caused by the fact 
that the halo is undersampled if the concentratation is low 
(chaio ^ 13) and the halo-scaling radius is correspondingly 
large. Then, the potential well is too shallow and \W\ is 
underestimated. For the deviations from virial equilibrium 
discussed in Sec. 18.31 it follows that they are mostly an ar- 
tifact of the A'-body realisation and not due to intrinsic 
non-stationarity of the models. 
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Figure Bl. Comparison of model kinematics (solid lines: best- 
fit; dashed lines Tfj t = T; n ) and input data (dots with error 
bars) for the TRIAX target. Top panel: major-axis; bottom panel: 
minor-axis. In each panel, from top to bottom v, a, H3 and 
H4; left: X-projection; middle: Y-projection; right: Z-projection. 
The luminous mass-to-light ratios of the Schwarzschild fits are 
Yfit/TTm = 0.9, 0.7, 0.4 (X,Y and Z-model, respectively). 



Figure B2. As Fig.|ET] but for the PROLATE target. T flt /T in 
0.9, 0.6, 0.8 (X,Y and Z-model, respectively). 
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Figure B3. As Fig. ED but for the ROUND target. T flt /T in 
0.9, 0.8, 0.7 (X,Y and Z-model, respectively). 



Figure B4. As Fig. |ET] but for the FLAT target. T Ht /T in 
0.9, 0.7, 0.6 (X,Y and Z-model, respectively). 
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Figure B5. As Fig. EH but for the ELONG target. Tflt/Tiu 
0.9, 0.6, 0.5 (X,Y and Z-model, respectively). 



Figure B6. As Fig.[5T] but for the OBLATE target. T flt /T in 
0.7, 0.9, 0.7 (X,Y and Z-model, respectively). 



